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REGULATION OF GENE EXPRESSION IN TOBACCO FOR MANIPULATION OF 
PLANT GROWTH AND SECONDARY METABOLISM 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of US Patent Application Ser, No. 60/ 132,919, filed 
May 6, 1999, now abandoned, which is hereby incorporated by reference in its entirety herein. 

FIELD OF THE INVENTION 
This invention relates to enzymes involved in alkaloid, and specifically nicotine, formation in 
tobacco plants. The invention is based, at least in part, on the nucleotide sequences encoding four 
variants of putrescine N-methyltransferase (PMT1, PMT2, PMT3, and PMT4), two variants of 
arginine decarboxylase (ADC 1 and ADC2), ornithine decarboxylase (ODC), S-adenosylmethionine 
synthetase (SAMS), a fragment of NADH dehydrogenase, and a fragment of 
phosphoribosylanthranilate isomerase. The invention also relates to proteins expressed by these 
nucleotides, promoter regions of these nucleotides, use of these promoter regions to culture 
transgenic plant cells and to produce transgenic plants, sense and antisense nucleotides 
complementary to all or portions of these nucleotide sequences, use of sense and antisense 
nucleotides to regulate gene expression, and assays using proteins involved in alkaloid formation in 
tobacco plants. 

BACKGROUND OF THE INVENTION 

I. Alkaloid Formation 

Alkaloids are one of the most diverse groups of secondary compounds found in plants and 
they are the product of a complex biosynthesis pathway (Hashimoto and Yamada, 1994; Chou and 
Kutchan, 1998; Waterman, 1998). Why plants accumulate these compounds and in so many 
different forms is not known. Moreover, for many alkaloids, the exact site of synthesis and the 
factors that control their intercellular distribution and accumulation remain to be determined 
(Hashimoto and Yamada, 1994; Kutchan, 1995; Chou and Kutchan, 1998). 

Nicotine is the most abundant alkaloid present in cultivated tobacco. Nicotine is formed 
primarily in the roots of the tobacco plant and subsequently is transported to the leaves, where it is 
stored (Tso, Physiology and Biochemistry of Tobacco Plants, pp. 233-34, Dowden, Hutchinson & 
Ross, Stroudsburg, Pa. (1972)). 

The synthesis and accumulation of nicotine and other tobacco alkaloids are known to be 
controlled by various developmental, environmental, and chemical cues. Changes in phytohormone 
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(e.g., auxin, cytokinin) levels and/or ratios as a consequence of developmental age (Hashimoto and 
Yamada, 1994; Kutchan, 1995) or by direct manipulation of plant cell culture conditions have been 
shown to affect the synthesis and accumulation of nicotine and various tobacco alkaloids (Hashimoto 
and Yamada, 1994; Hibi et al., 1994; Eilbert, 1998). Various abiotic factors (wounding, drought 
stress, pH imbalance, etc.) [Hashimoto and Yamada, 1994; Kutchan, 1998; Waterman, 1998) 1, 2, 4], 
as well as biotic factors, such as herbivory, insect feeding, and attack by various microbial and fungal 
pathogens, are known elicit increased production of nicotine and other alkaloids in the leaves of wild 
and cultivated tobacco species (Baldwin, 1989; Saito and Murakoishi, 1998; Baldwin and Prestin, 
1999). In addition, the commercial practice of topping (i.e., removal of flowering head and young 
leaves at the upper portions of the plant), results in increases in nicotine and the amount and 
complexity total alkaloids present in the leaves of Nicotiana tabacum (Hashimoto and Yamada, 
1994; Hibi et al, 1994). The factors controlling the topping-induced increase in alkaloid 
biosynthesis are not known, but likely involve a complex physiological response in the plant as a 
result of altered phytohormones and wound induced signaling (Akehurst, 1981; Hibi et al y 1994; 
Kutchan, 1998). In this regard, considerable evidence now exists indicating that a jasmonic acid 
(J A)- mediated signal transduction pathway may play a role in regulation of gene expression 
contributing to this increase in alkaloid biosynthesis (Baldwin et al, 1994, 1996,1997; Ohnmeiss et 
al, 1997; Imanishi etal 9 1998a, 19986). 

The nicotine molecule is comprised of two heterocyclic rings, a pyridine moiety and a 
pyrrolidine moiety, each of which is derived from a separate biochemical pathway. The pyridine 
moiety of nicotine is derived from nicotinic acid. The pyrrolidine moiety of nicotine is provided 
through a pathway leading from putrescine to N-methylputrescine and then to N-methylpyrroline. 
(Goodwin and Mercer, Introduction to Plant Biochemistry, pp. 488-91, Pergamon Press, New York, 
(1983)). 

Putrescine is formed in plants by one of two pathways (Chattopadhyay and Ghosh, 1998). It 
can be synthesized directly from ornithine, in a reaction catalyzed by the enzyme ornithine 
decarboxylase (ODC, EC 4.1.1.17), or formed indirectly from arginine in a reaction sequence 
initiated by arginine decarboxylase (ADC, EC 4.1.1.19). Putrescine formed by the ADC and/or ODC 
pathway serves as precursor in the synthesis of the higher polyamines, spermine and spermidine, 
catalyzed by the enzymes spermine synthase and spermidine synthase, respectively, or it is converted 
to N-methylputrescine by the action of putrescine N-methyltransferase (PMT), the first committed 
step in nicotine biosynthesis (Hashimoto and Yamada, 1994; Kutchan, 1995; Chattopadhyay and 
Ghosh, 1998). N-methyl putrescine is oxidized by a diamine oxidase and cyclized to form the 1- 
methyl-A'-pyrrolium cation, which is condensed with nicotinic acid or its derivative to form nicotine 
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(Hashimoto and Yamada, 1994). 

Putrescene is a precursor for N-methylputrescine, which then forms N-methylpyrroline. 
Conversion of putrescine to N-methylputrescine is catalyzed by the enzyme putrescine 
N-methyltransferase ("PMT"), with S-adenosylmethionine serving as the methyl group donor. PMT 
5 appears to be the rate-limiting enzyme in the pathway supplying N-methylpyrroline for nicotine 

synthesis in tobacco (Feth et ah, "Regulation in Tobacco Callus of Enzyme Activities of the Nicotine 
Pathway", Planta, 168, pp. 402-07 (1986); Wagner et al., "The Regulation of Enzyme Activities of 
the Nicotine Pathway in Tobacco", Physiol. Plant., 68, pp. 667-72 (1986)). 

10 II. TRANSGENIC PLANTS 

The methods of nicotine formation in tobacco and the genes involved have been studied both 
to better understand differential gene expression during tobacco growth and development, and also to 
discover tools useful for creating transgenic plants. For example, the regulatory sequences that 
modify protein expression in tobacco may be useful in creating transgenic tobacco or other 

15 transgenic plants. 

It has already been demonstrated that tissues of many plant species may be transformed by 
exogenous, typically chimeric, genes which are effective to stably transform cells of the tissues. For 
several species, tissues transformed in this fashion may be regenerated to give rise to whole 
transqenic or genetically engineered plants. The engineered traits introduced into the transgenic 

20 plants by these techniques have proven to be stable and have also proven to be transmissible through 
normal Mendellian inheritance to the progeny of the regenerated plants. One such desirable trait is 
the production in the plant cells of desired gene products in vivo in the cells of the transqenic plants. 
For a chimeric gene to be effective, the foreign DNA sequence containing a coding region should be 
flanked by appropriate promotion and control regions. Commonly used plant cell transcription 

25 promoters include the nopaline synthase promoter from the T-DNA of A. tumefaciens and the 35S 
promoter from the cauliflower mosaic virus. 

In order for the newly inserted chimeric gene to express the protein for which it codes in the 
plant cell, the proper regulatory signals must be present and in the proper location with respect to the 
gene. These regulatory signals include a promoter region, a 5* non-translated leader sequence and a 3' 

30 polyadenylation sequence. A promoter is a DNA sequence that directs the cellular machinery of a 
plant to produce RNA from the contiguous structural coding sequence downstream (3') to the 
promoter. The promoter region influences the rate at which the RNA product of the gene and 
resultant protein product of the gene is made. The 3 1 polyadenylation signal is a non-translated region 
that functions in 
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the plant cells to cause the addition of polyadenylate nucleotides to the 3* end of the RNA to enable 
the mRNA to be transported to the cytoplasm and to stabilize the mRNA for subsequent translation 
of the RNA to produce protein. 

Other plant cell transformation techniques are directed toward the direct insertion of DNA 
into the cytoplasm of plant cells from which it is taken up, by an uncharacterized mechanism, into 
the genome of the plant. One such technique is electroporation, in 

which electric shock causes disruption of the cellular membranes of individual plant cells. Plant 
protoplasts in aqueous solution when subject to electroporation will uptake DNA from the 
surrounding medium. Another technique involves the physical acceleration of DNA, coated onto 
small inert particles, either into reqenerable plant tissues or into plant germline cells. 

The availability of cloned nucleic acid sequences encoding an enzyme involved in alkaloid 
synthesis allows for the potential manipulation of alkaloid contents. Furthermore, the availability of 
promoters useful for expressing genes in plants allows for the creation of chimeric molecules and 
transgenic plants, which in turn result in possible native plant production of desirable proteins. 

Previously reported work discloses cloning nucleotides encoding proteins involved in the 
biosynthesis of nicotine, and isolating such proteins. Approximately twenty or more cDN As and/or 
genomic DNA fragments encoding different enzymes involved with alkaloid formation have been 
isolated (Chattopadhyay and Ghosh, 1998). For example, successful cloning of partial or full-length 
cDNA encoding ODC activity from tobacco was disclosed by (Malik et al, J. Plant Biochem. 
&Biotech. 5: 109-1 12 (1996)). Also, a relatively crude preparation of PMT (30-fold purification) has 
been subjected to limited characterization (Mizusaki et al., "Phytochemical Studies on Tobacco 
Alkaloids XIV. The Occurrence and Properties of Putrescine N-Methyltransferase in Tobacco 
Plants", Plant Cell Physiol., 12, pp. 633-40 (1971)). A process for purifying PMT is disclosed in US 
Patent No. 5,369,023, "Method of purifying putrescine n-methyltransferase from tobacco plant 
extract with an anion exchange medium", hereby incorporated by reference in its entirety herein. 
Several laboratories have reported the cloning of partial or full-length cDNAs encoding ADC (Bell 
and Malmberg ,1990; Rostogi et al., 1993; Perez-Amador et al, 1995; Nam et al., 1997; Watson and 
Malmberg, 1996). Comparisons of the amino acid sequences of ADC from various plants revealed a 
high degree of conservation among the various proteins, as well as homology to ODC (Malmberg et 
al., 1998). 

It is an object of the present invention to characterize the nucleotide and amino acid 
sequences of enzymes involved in the biosynthesis of nicotine in tobacco. It is also an object of the 
present invention to provide plant promoter regions that are capable of conferring high levels of 
transcription in rapidly dividing cells of transformed plants when coupled with a heterologous coding 
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sequence in a chimeric gene. Further, the invention is directed to chimeric genes incorporating such 
promoter regions, stable transfection of plants with these chimeric genes, and the plants and cells that 
are transfected, as well as seeds of such transfected plants. It is a further object to characterize sense 
and antisense nucleotides capable of regulating expression of genes encoding for enzymes involved 
in the biosynthesis of alkaloids. 

SUMMARY OF THE INVENTION 
Proteins involved in the biosynthesis of nicotine in tobacco N. tabacum are the subject of this 
invention. More specifically, the invention concerns four variants of putrescine N-methyltransferase 
(PMT1 , PMT2, PMT3, and PMT4), two variants of arginine decarboxylase (ADC 1 and ADC2), 
ornithine decarboxylase (ODC), S-adenosylmethionine synthetase (SAMS), NADH dehydrogenase, 
and phosphoribosylanthranilate isomerase. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1. Genomic DNA gel blot analysis of the PMT gene family in N tabacum cv. Xanthi. 
Total genomic DNA (30 \ig) was digested with Kpnl, EcoRl, qtEcoKL and Kpnl, separated by 
agarose gel electrophoresis, and transferred to nylon membranes. The membrane was hybridized 
with a 32p_i a beled antisense strand probe covering the complete coding region of the NtPMTla 
cDNA. Identity of the hybridizing bands as determined by comparison to phage DNA digests is 
indicated. Molecular weights are given in kb. Note that Kpnl shifts only the NtPMTlb band in the 
gel blot because this restriction site is present ony in Exon 1 of NtPMTlb and not NtPMTla. 

Figure 2. Amino acid sequence alignment of N tabacum PMTs. Shown is a PILEUP alignment of 
the predicted amino acid sequences of the various tobacco PMTs. Amino acid residues that differing 
among the PMTs are shaded. NtPMTla, NtPMT2, NtPMT3, and NtPMT4 refer to the deduced 
amino acid sequences of the PMTs encoded by the NtPMTla, NtPMT2, NtPMT3, and NtPMT4 
genes, respectively, isolated from N tabacum cv. Xanthi genomic DNA; cNtPMTla is the predicted 
amino acid sequence of the A41 1 cDNA (Accession No. D28506) isolated from N. tabacum cv. 
Burley 21 by Hibi et al. (1994). The location of the exon-intron boundaries are indicated by the dark 
vertical line. The nucleotide sequences for NtPMTla, NtPMTl, NtPMT3, and NtPMT4 appear in 
GenBank under the accession numbers AF 1268 10, AF 126809, AF12681 1, and AF 1268 12, 
respectively 

Figure 5. Polyacrylamide gel electrophoresis analysis of PCR amplified genomic DNA fragments 
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encoding Exon 1 of PMT from various species of Nicotiana. PCR amplification was carried out as 
described in the Materials and Methods using Exon 1 -specific primers 1 and 2 and total genomic 
DNA isolated from//, tabacum, N. otophora, and AT. tomentosiformis. The amplification products 
were separated by electrophoresis on 6.5% polyacrylamide gels, the gels fixed, and subject to 
5 autoradiography. The amplification products isolated from N. tabacum cv. Burley 21 and N. tabacum 
cv. Xanthi were identical and only the amplication products from the reactions with N. tabacum cv. 
Burley 21 DNA are shown. Standards were generated in identical reaction conditions primed with 
plasmid DNA encoding the various PMT genes isolated in this study. 

10 

Figure 4. Nucleotide sequence alignment of the 5'-flanking regions of the K tabacum PMT genes. 
Shown is a PILEUP alignment of the nucleotide sequences upstream of the initiating methionine 
(MET) codon of the four PMT genes isolated from N. tabacum cv. Xanthi. The proposed start site 
for transcription of the NtPMTla gene is indicated by the +1 above the sequences. The TATA-box 

15 and CCAAT-box motifs are boxed. Potential transcriptional regulatory elements identified by 
MOTIF search programs are also boxed and indicated by the following abbreviations:. PAL: 
palindromic sequences; G-Box: G-Box homologous sequences; MRE: metal -responsive 
element homolog. Nucleotides identical in three or more sequences are shaded. The polyguanine- 
rich region is underlined. Numbering is indicated to the right and is relative to the proposed start site 

20 of each gene. 

Figure 5. RNA gel blot analysis of PMT transcript levels in various tissue:; Total RNA was isolated 
from various tissues of mature N. tabacum cv. Burley 21 and analyzed by gel blot analysis using a 
32 P-labeled NtPMTla cDNA coding region (Exons 2 to 8) probe capable of detecting all PMT 
transcripts. 

25 A. PMT transcript levels in various tobacco plant tissues and/or organs. 

B. Induction of PMT expression in tobacco roots following topping. Abbreviations: HP, wild-type 
(NiclNic2) Burley 21; LP, low alkaloid (niclnic2) mutant. The p-subunit of mitochondrial 
ATPase (P-ATPase) served as a control. 

30 Figure 6. Semi-quantitative RT-PCR analysis of PMT gene expression in roots of tobacco plant 
before and after topping. 

A. Shown is relative abundance of the individual PMT gene transcripts before and after topping. RT- 
PCR was carried out as described in the Material and methods using Exon 1 specific primers. 
Messenger RNA was amplified from total RNA isolated from the roots of wild-type (HP, 
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NiclNicZ) Burley 21 and low alkaloid (LP, niclnicl) Burley 21 tobacco plants. Far right lane 
represents size standards for the genes isolated by PCR amplification from plasmid DNA. The P- 
subunit of mitochondrial ATPase (P-ATPase) served as a control. 

5 B. Bar graphs showing relative expression of the individual PMT genes following topping in both HP 
and LP tobacco roots. Abbreviations: HP, wild-type (NiclNirt) Burley 21; LP, low alkaloid 
(niclnic2) mutant. 

Figure 7. The nucleotide and predicted amino acid sequences of the transcribed portions of the N. 

10 tabacum cv Xanthi NtADCl and NtADC2 genes. Shown are the complete nucleotide and predicted 
amino acid sequence of the N. tabacum cv Xanthi NtADCl gene and where it differs from the 
NtADC2 gene sequence. The dots indicate nucleotide sequence identity and the stars indicate amino 
acid sequence identity. The proposed polyadenylation signal is underlined. The sequences terminate 
at the point of polyadenylation found in the full length cDNA (Wang, 1999; AF127239).The 

15 complete nucleotide sequences for the N. tabacum cv Xanthi NtADCl (AF 127240) and NtADC2 
(AF127241) including the 5' and 3' flanking sequences appear in Genbank. 

Fig. 8. Comparison of the predicted amino acid sequences of arginine decarboxylases (ADCs) from 
various species. Shown is a PILEUP alignment of the predicted amino acid sequence of the N. 
20 tabacum cv Xanthi NtADCl gene (AF 127240) aligned to the predicted ADC protein sequences from 
N. sylvestris (AB12873), Arabidopsis thaliana (AF009647), Avena sativa (oat) (X56802), 
Lycopersicon esculentum (tomato) (LI 65 82) and Escherichia coli (M31770). Amino acid residues 
conserved among the various ADC are shaded. 

25 Fig. 9. Gel blot analysis of ADC transcript levels in the roots of wild-type and low alkaloid mutant 
Burley 21 tobacco before and after topping. Total RNA was isolated from the roots of mature wild- 
type and low alkaloid mutant N. tabacum cv. Burley 21 and analyzed by gel blot analysis using [cc- 
32 P]-dCTP labeled probes recognizing the coding region of ADC or the P-subunit of tobacco 
mitochondrial ATP synthase (Boutry and Chua, 1985). Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the atp2 control band in each lane. The experiment was conducted twice with 
different total RNA samples. 
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Fig. 10. Differential expression of NtADC-1 and NtADC-2 in various tissues of tobacco. Expression 
of the NtADC-1 and NtADC-2 genes was analyzed using semi-quantitative RT-PCR and gene 
specific primers capable of discriminating between transcripts arising from the two genes. Panel A 
shows control reactions demonstrating primer specificity in the PCR reactions using plasmids 
5 containing the NtADC-1 and NtADC-2 coding sequences. The numbers above the lane refer to the 
specific primer combinations as described in the Materia and methods. Panel B shows the results of 
RT-PCR reactions using first strand cDNA synthesized from total RNA extracted from either root, 
leaf, or flowers. As a internal control, primers specific for the atp2 gene transcript were include in the 
amplification reactions. All reactions were carried out within the linear range of template 
10 amplification as determined by varying template amount, amplification time, and temperature as 
described in Riechers and Timko (1999). 

Fig. 11. Genomic DNA gel blot analysis of the ODC gene family in K tabacum. Total genomic 
DNA (30 ^g) was digested with EcoKL or HindUl, fractionated by agarose gel electrophoresis, 
15 transferred to nylon membranes and hybridized with an a- 32 P-dCTP labeled probe encoding full- 
length ODC cDNA as described in the Materials. The mobility of molecular weights standards are 
given to the right of the figure in kilobases (kb). 

Fig 12. Comparison of the nucleotide and predicted amino acid sequences of the NtODC-1 and 
20 NtODC-2 genes. Shown are the nucleotide and predicted amino acid sequences of the NtODC-1 

(AF233850) and NtODC-2 (AF233849) genes. In the figure, the complete amino acid sequence of 
the pODC2 is given and the pODC 1 sequence is given only where it differs. The start site of 
transcription is designated as +1 and the poly(A) addition site is indicated by the arrow. Within the 
relevant regions of homology, nucleotide differences between the NtODC-1 and NtODC-2 genes are 
25 in bold lettering. The proposed TATA-box, and polyadenylarion signal are shaded. 

Fig. 13. Protein sequences alignment of ornithine decarboxylases (ODCs) from various species. 
Shown is a PILEUP alignment of the predicted amino acid sequences of the N. tabacum cv. Xanthi 
pODC2 protein (AF233849) with the ODCs from N. tabacum cv. SC58 (Y 10472) and cv. BY-2 
30 (ABO31066), Lycopersicon esculentum (tomato) (AF030292), Datura stramonium (jimsonweed) 
(X87847), Saccharomyces cerevisiae (NP_012737), and humans (Homo sapiens; AAA59966). 
Amino acid residues conserved among the various ODCs are shaded. 
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Fig* 14. Gel blot analysis of ODC transcript levels in various tissues of mature tobacco plants and 
in the roots before and after topping. Total RNA was isolated from various tissues of mature N. 
tabacum cv. Burley 21 and analyzed by gel blot analysis using an a- 32 P-dCTP labeled coding region 
probes for ODC. (A) Transcript levels in various organs of wild-type tobacco: R, root: S, stem ; L, 
leaf ; SE, sepal; PE, petal; O, ovary; S, stamen; and AN, anther. (B) Transcript levels in roots of 
Burley 21 tobacco plants before and after topping. RNA gel blot analysis of the tissues-specific 
distribution and post-topping expression of transcripts encoding ODC in tobacco. As a control, the 
blots were also probed with radioactively labeled probes encoding the alkaloid biosynthesis enzyme 
putrescine N-methyltransferase (PMT) and a root specific p-glucosidase (TBG-1). 



DETAILED DESCRIPTION OF THE INVENTION 
Nucleic acid sequences have been isolated from tobacco that encode important enzymes in 
nicotine and total alkaloid formation, including PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, 
and SAMS. Also identified are cDNA fragments encoding partial segments of NADH 
15 dehydrogenase and phosphoribosilanthronilate isomerase. Also identified are promoter regions for 
the nucleotides encoding PMT1, PMT2, PMT3, PMT4, and ADC2. All of these nucleic acids are 
isolated from Nicotiana tabacum L. 

"Promoter" and "promoter region" are terms used interchangeably herein to refer to a DNA 
sequence that regulates expression of a selected DNA sequence operably linked to the promoter, and 
20 which effects expression of the selected DNA sequence in cells. The term also encompasses the 
5'untranslated region that may be transcribed into mRNA but is not translated. 

"Protein", "polypeptide", and "peptide" are used interchangeably herein when referring to a 
gene product. 

In one aspect, the invention features isolated nucleic acid molecules encoding for PMT1, 
25 PMT2, PMT3, PMT4, ADC1, ADC2, ODC, and SAMS, a fragment of NADH dehydrogenase and a 
fragment of phosphoribosilanthronilate isomerase. The disclosed molecules can be non-coding (e.g. 
probe, antisense or ribozyme molecules) or can code for a functional enzyme. In one embodiment, 
the nucleic acid molecules can hybridize to the nucleic acid sequences encoding for PMT1, PMT2, 
PMT3, PMT4, ADC1, ADC2, ODC, SAMS, a fragment of NADH dehydrogenase, or a fragment of 
30 phosphoribosilanthronilate isomerase or to the complements of these nucleic acid sequences. In a 

preferred embodiment, the hybridization is conducted under mildly stringent or stringent conditions. 

In further embodiments, the nucleic acid molecule is at least 50%, 60%, 70%, 80% and more 
preferably at least 90% or 95% homologous in sequence to the nucleic acid sequences encoding for 
PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, a fragment of NADH dehydrogenase, or 
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a fragment of phosphoribosilanthronilate isomerase or to the complements of these nucleic acid 
sequences. In another embodiment, the nucleic acid encodes a polypeptide that is at least 50%, 60%, 
70%, 80% and more preferably at least 90% or 95% similar in sequence to the amino acid sequence 
of PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, the fragment disclosed herein of 
5 NADH dehydrogenase, or the fragment of phosphoribosilanthronilate isomerase disclosed herein. 

In another embodiment, the invention features isolated polypeptides, preferably substantially 
pure preparations, encoded for by the nucleic acid sequences of the invention. Particularly preferred 
are those polypeptides encoded for by the nucleic acid sequences identified by (SEQ. ID. NO. 2), 
(SEQ. ID. NO. 5), (SEQ. ED. NO. 8), (SEQ. ID. NO. 1 1), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15), 

10 (SEQ. ID. NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) 
or comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) or (SEQ. ID. NO. 24). In particularly preferred 
embodiments, the subject polypeptides can aid in regulating the production of alkaloids, particularly 

15 nicotine, in plants. In one embodiment, the polypeptide is identical to or similar to the protein 

represented by the amino acid sequences of (SEQ ID NO. 3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), 
(SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) 
or (SEQ. ED. NO. 24). In a preferred embodiment, the polypeptide is encoded by a nucleic acid that 
hybridizes with a nucleic acid represented in. 

20 The polypeptides of the present invention can comprise full length proteins, such as represented 

by (SEQ ID NO. 3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. ) 2), (SEQ. ID. NO. 14), 
(SEQ. ID. NO. 16), (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) and (SEQ. ID. NO. 24) , or can comprise 
one or more fragments corresponding to one or more particular motifs/domains, or to arbitrary sizes, 
e.g., at least 5, 10, 25, 50, 100, 150, or 200 amino acids in length. 

25 Another aspect of the invention features chimeric genes comprised of a promoter for the genes 

for PMT2, PMT1, PMT3, PMT4, or ADC2. Yet another aspect of the invention features chimeric 
genes or chimeric molecules comprised respectively of the functional gene encoding for or the 
protein PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, NADH dehydrogenase and/or 
phosphoribosilanthronilate isomerase. 

30 The invention also concerns isolated and purified promoter regions for tobacco Beta- 

glucosidase and their use in chimeric genes or chimeric molecules. 

Another aspect of the invention involves vectors capable of transporting another nucleic acid to 
which a vector has been linked. Preferably, the vectors comprise the nucleic acid sequences of the 
invention or their complements. 
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The invention also features transgenic plants and their seeds that include (and preferably 
express) a heterologous form of PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, NADH 
dehydrogenase and/or phosphoribosilanthronilate isomerase. The present invention also 
encompasses transgenic plants that contain in their genome a chimeric gene construction 
5 incorporating the nucleic acid encoding PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, 
NADH dehydrogenase and/or phosphoribosilanthronilate isomerase. Such transgenic plants and their 
seeds may be useful to natively produce enhanced quantities of desirable exogenous proteins, such as 
compounds useful for pharmaceutical purposes, or proteins that provide herbicide resistance. 

Another feature of the invention is the use as probes of the DNA sequences disclosed herein or 
10 oligonucleotide fragments thereof. Probes may be useful to obtain additional gene family members 
or locate homologous genes in tobacco or other plant species. Copies of related genes can be 
obtained from existing genomic libraries or the genomic libraries can be constructed. In one 
embodiment, an isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
15 sequence found in the sequences of the invention is used as a probe. 

Another feature is use of the polypeptides of the invention in an assay, such as an assay to 
identify modulators of enzyme activity in plants. 

Other features and advantages of the invention will be apparent to those of skill in the art. 

The nucleotide and amino acid sequences of the invention are disclosed herein in the Sequence 
20 Listing, text, and the figures. Specific sequences of the invention are provided in the attached 
Sequence Listing and can be understood to represent promoters, nucleic acids, and proteins 
respectively relating to the following proteins: PMT2 (SEQ. ID. NOS. 1, 2, and 3); PMT1 (SEQ. ID. 
NOS. 4, 5, and 6); PMT3 (SEQ. ID. NOS. 7, 8, and 9); PMT4 (SEQ. ID. NOS. 10, 1 1, and 12); 
SAMS (SEQ. ID. NOS. 13 and 14 ); ODC (SEQ. ID. NOS. 15 and 16); ADC1 (SEQ. ID. NOS. 17, 
25 1 8, and 19); ADC2 (SEQ. ID. NOS. 20, 21, and 22); ADC1 mRNA (SEQ. ID. NOS. 23 and 24); 
NADH dehydrogenase (SEQ. ID. NO. 25); andPAI (SEQ. ID. NO. 26). If only two sequence 
identifiers are provided for a protein, those sequences represent the nucleic acid and the protein 
respectively. If three identifiers are provided, the identifiers represent promoter, genomic or cDNA 
nucleic acid, and peptide sequences, respectively. If only one identifier is provided, it represents a 
30 DNA fragment coding for the protein or portions of it. 

For other reference, the sequences may be found at the following records in GenBank at the 
following Accession Numbers, which records are hereby incorporated in their entirety herein: 
AF126810 (NtPMTl); AF126809 (NtPMT2); AF126811 (NtPMT3); AF126812 (NtPMT4), 
AF176908 (NtomPMT)(Nicotiana tomentosiformis); AF76909 (NotoPMT)(Nicotiana otophora); 
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AF127239 (ADC); AF127240 (ADC1); AF127241 (ADC2); AF127242 (ODC); AF233849 (ODC2); 
AF233850 (ODC1); and AF127243 (SAMS). 

The following experimental discussion is presented to better illustrate the invention. 
I. PMT 

The present invention features the characterization of four members of the nuclear gene family 
encoding PMT in tobacco N. tabacum. The nucleic acid sequences encoding PMT and the amino 
acid sequences for the proteins are reported herein and can also be found in the DDB J, EMBL, and 
GenBank Nucleotide Sequence Databases under the accession numbers for NtPMTJa, NtPMT2, 
NtPMT3> and NtPMT4 at AF126810, AF126809, AF12681 1, and AF126812, respectively. The 
complete coding region and immediate 5 1 - and 3 - flanking regions are characterized. 

As the discussion below shows, all four PMT genes present in the N. tabacum genome are 
expressed in the roots of wild-type plants and differentially regulated in tobacco lines expressing 
either high or low total alkaloid contents. 

Materials and Methods 

Plant materials 

Seeds of N. sylvestris, N. otophora, and N. tomentosiformis were obtained from the USDA-ARS 
national tobacco germplasm collection (Oxford, NC). N. tabacum cv. Burley 21 and JV. tabacum cv. 
Xanthi seeds were kindly provided by Glenn Collins, University of Kentucky. Tobacco plants used 
for DNA isolation were grown in a soilrvermiculite mixture in the greenhouse under natural lighting 
conditions. Plants used for RNA extraction were grown in Moltan Plus (Moltan Co., Middleton, 
TN). 

Gel blot analysis of genomic DNA 

Young leaves were collected from greenhouse grown tobacco (N. tabacum cv. Xanthi) plants and 
total genomic DNA was isolated from freshly-harvested tissues using a modification of the CTAB 
extraction method (Dellaporta et a/., 1983). Approximately 30 ug of total DNA was digested with 
£coRI, Kpnl, or EcdRL and Kpnl and the digestion products separated by electrophoresis through a 
0,75% agarose gel. Restricted and size-fractionated DNA was denatured and transferred to Nytran+ 
nylon membranes (Schleicher and Schuell, Keene, NH) by capillary blotting in 0.4N NaOH 
overnight. Membranes were prehybridized in 0.25M NajHPC^ (pH 7.4), 7% SDS, 1 mM Na^DTA 
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for at least 2 hr, then hybridized overnight at 65 °C in the same buffer with 2-3 x 10 6 cpm/mL of a 
32 P-labeled single-stranded probe (antisense DNA strand). The probe was prepared by the method of 
Bednarczuk et al. (1991) using a primer (Table 1, primer 4) designed from the 3* end of the 
NtPMTla coding region (Exon 8) and the full-length coding region of the NtPMTla cDNA as 
template. The NtPMTla cDNA was generated by RT-PCR using synthetic oligonucleotide primeTS 
based on the N- and C-terminal sequences of the A41 1 cDNA reported by Hibi et al (1994) and 
RNA template isolated from N. tabacum cv. Burley 21 roots. Membranes were washed at a final 
stringency of 0. 1 x SSC, 0. 1% SDS at 65 °C. Hybridizing bands were visualized by autoradiography 
and/or imaged using a Molecular Dynamics Phosphorlmager (Model 445 SI, Sunnyvale, CA). 

Genomic library construction and phage isolation 

A library of N tabacum cv. Xanthi genomic DNA fragments constructed in EMBL3 was purchased 
from Clontech (Palo Alto, CA) and a total of 1.1 x 10 6 recombinant phage were screened by plaque 
hybridization using random-primed 32p.i a beled NtPMTla cDNA as probe (Sambrook et al, 1989). 
Prehybridization, hybridization, and washing conditions were as described above. Positive 
hybridizing phage were plaque purified by subsequent rounds of rescreening and DNA was prepared 
from 18 independently isolated phage. The phage DNA was characterized by restriction analysis and 
DNA gel blot analysis and fragments containing the sequences encoding PMT were subcloned into 
pBluescript KS vectors for further analysis. 

Comparison of the hybridizing fragments present in the 18 recombinant phage to the 
hybridization pattern obtained by genomic DNA blot analysis indicated that only three of the PMT 
genes suspected to be present in the N tabacum genome were recovered from the library screen. To 
obtain sequences encoding NtPMTla, a subgenomic library was constructed from N tabacum cv. 
Xanthi DNA. The library consisted of gel-purified 2.5-3.5 kb EcoRl fragments ligated into A,_ZAP 
II vector arms and packaged using Gigapack III Gold packaging extracts according to the 
manufacturer's instructions (Stratagene, La Jolla, CA). The primary library was amplified once in E. 
coli XLl-Blue MRF' strain (Stratagene) and screened as described above, except that a random- 
primed 32 P-labeled NtPMTla cDNA Exon 1 -specific probe was used (Table 1). Exon 1 had 
previously been amplified by PCR using primers 1 and 2 (Table 1) and the NtPMTla cDNA as 
template. The recombinant phage that hybridized with the probe was isolated from the sublibrary by 
two more rounds of plaque purification, and the pBluescript phagemid containing the approximate 
3.1 kb EcoKL genomic fragment with the NtPMTla gene was excised from the XJLAP II phage 
vector using the in vivo excision protocol described by Stratagene. 
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DNA sequence analysis 

Unless otherwise noted, DNA sequencing was performed with double-stranded plasmid DNA 
5 templates using fluorescent dye terminator technology (dRhodamine Terminator Cycle Sequencing 
Ready Reaction kit) on an ABI 3 10 DNA sequencer (Perkin-Elmer Applied Biosystems). For 
analysis of PCR products, following electrophoretic separation of amplification reaction products, 
the bands of interest were excised from the polyacrylamide gels, the DNA extracted using the 
Quiagen Gel Extraction Kit, and the recovered DNA used as sequencing template. Sequencing was 

10 performed using AmpliTaq DNA polymerase and fluorescent dye terminator technology (as 

described above) and primers 1 and 2 (Table 1) specific for Exon 1. Nucleotide and amino acid 
sequences were analyzed and aligned using either the Clustal method and Lasergene software 
(DNAStar Inc., Madison, WI) or the PILEUP and ALSCRIPT (Genetics Computer Group, Madison, 
WI) sequence analysis package (Version 9.0). Transcription factor binding site homologies were 

15 identified in promoter DNA sequences by searching the transcription factor database using the GCG 
program. 

RNA gel blot analysis 

20 For RNA analysis, roots and other tissues were harvested from mature wild-type (HP; NiclNic2) and 
low alkaloid mutant (LP; niclnicl) Burley 21 tobacco plants. For topping experiments, the stem was 
cut and the top one-third of the plant was removed just prior to flower opening. Roots were 
harvested just prior to topping (0 hr control) and at various times after decapitation. The tissue was 
immediately frozen in liquid nitrogen and stored at -80 °C until RNA extraction and isolation. 

25 Total RNA was isolated from vegetative organs and floral structures of HP and LP Burley 21 

tobacco using the TRI-reagent (Molecular Research Center Inc., Cincinnati, OH) and quantified 
spectrophotometrically by measuring A260. Total RNA (5 jig) was electrophoresed through 1.2% 
agarose gels (containing 0.4 M formaldehyde) and transferred to Nytran+ nylon membranes. 
Following prehybridization the membranes were hybridized with a single-stranded NtPMTla cDNA 

30 antisense probe (corresponding to the antisense strand of Exons 2 to 8 of the NtPMTla cDNA coding 
region) as described above. As a control to quantify and normalize RNA levels in each lane, the blot 
was hybridized with a 400-bp probe derived from the P-ATPase cDNA using primers 6 and 7 (Table 
1) as described below. 
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Semi-quantitative RT-PCR analysis of individual PMT transcript levels 

Total RNA (1 |ig) extracted from the roots of HP and LP Burley 21 tobacco plants was reverse- 
transcribed into first-strand cDNA at 42 °C using Superscript II reverse transcriptase (Gibco BRL) 
5 according to the manufacturer's protocol. Two gene-specific primers were employed in the reactions: 
primer 5 capable of recognizing Exon 3 of the PMT genes and primer 8 specific for Exon 8 of the 
nuclear gene encoding the P-subunit of mitochondrial ATPase from N. plumbaginifolia (NpATP2J) 
and N. sylvestris (NsATP2.1) (Boutry and Chua, 1985; Lalanne et at., 1998). The P-ATPase 
transcript served as an internal reference (constitutively-expressed control) to determine loading 

10 accuracy and to normalize expression levels (Kinoshita et al. t 1992) Following first strand cDNA 
synthesis, two sets of nested primers (0.4 each primer) were used to amplify the PMT and p- 
ATPase transcripts: primers 1 and 2 (Table 1) recognized Exon 1 in all five PMT transcripts and 
gave products ranging in size from 220 bp to 420 bp and primers 6 and 7 amplified an approximately 
400-bp region encompassing a portion of Exons 6 to 8 of the P-ATPase coding region. 

15 Amplification was carried out for 25 cycles using the following reaction conditions: denaturation at 
95°C for 1 min, primer annealing at 60°C for 35 sec, and extension at 72°C for 1.5 min; a final 
extension was conducted at 72°C for 6 min. Amplification products were radioactively labeled by 
spiking the PCR reaction with 10 jiCi 32P-dCTP. Aliquots of the PCR reaction were analyzed on a 
6.5% non-denaturing poly aery lamide/ IX TBE gel and electrophoresed at 600 volts. The reaction 

20 conditions were optimized to provide amplification of both PMT and $-ATPase transcripts in the 
linear range of the reaction by varying the levels of first strand cDNA template, annealing 
temperature, and number of cycles of amplification as described in Kinoshita et al. (1992). 
Molecular weight standards were prepared by PCR amplification using the same primers and 
protocol described above and plasmid DNA templates containing the PMT encoding genomic 

25 fragments, as well as genomic DNA from the various Nicotiana species indicated in the text. 

Following electrophoresis, the polyacrylamide gels were fixed in 5% MeOH, 7.5% acetic acid 
for 30 min, dried overnight, and used to expose X-ray film. PMT band intensities were quantified 
using phosphorimager analysis (Molecular Dynamics) and normalized relative to the intensities of 
the P-ATPase control band in each lane. The experiment was conducted twice with different total 

30 RNA samples, and representative results are presented from one of the two experiments. 

Results 



PMT gene structure and organization in N. tabacum 
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Gel blot analysis of total genomic DNA isolated from N. tabacum cv. Xanthi, hybridized with a 
radioactively-labeled cDNA (NtPMTla) encoding the complete coding region of putrescine N- 
methyltransferase (PMT) showed the presence of five major hybridizing bands in Kpril or EcoKL 
digested DNA, consistent with the presence of a small multigene family in the N tabacum genome 
5 (Figure 1). 

As part of our initial characterization of the gene family encoding PMT in N. tabacum, an 
EMBL3 genomic library, prepared from N. tabacum cv. Xanthi DNA, was screened using the 
NtPMTla (A4\ 1 homologous) cDNA as probe. From a total of 1 8 recombinant phage isolated, three 
phage were recovered that contained genomic fragments encoding the NtPMT2 y NtPMT3 and 

1 0 NtPMT4 genes. The three PMT genes were completely encoded within a unique sized EcoKL 

fragment within the phage DNA insert which allowed for the correlation of each with a hybridizing 
restriction fragment on the gel blot ofN tabacum genomic DNA (Figure 1). The complete coding 
region and immediate 5 1 and 3' non-coding sequences of the three genes were determined and found 
to encode full-length PMT proteins (Figure 2). Each PMT gene consisted of 8 exons and 7 introns, 

15 consistent with the gene structure reported previously for the PMT genes from N sylvestris 

(Hashimoto et ah, 1998a). Comparison of the deduced amino acid sequences (Figure 2) revealed 
that the encoded PMT proteins were extremely similar over their entire length, with the only 
significant variability in primary sequence localized to the extreme N-terminal region of the protein. 
This region, completely encoded within Exon 1 , contains a variable number of an 11 amino acid 

20 repeat with a consensus sequence of NGHQNGTSEHQ. The function of the repeated sequence is 

unknown, but is apparently inconsequential to enzyme function, since the number of repeats does not 
influence activity and PMTs characterized from other species do not contain the repeated element 
(Hashimoto etal, 1998a; Suzuki etal, 1999a). 

Multiple rounds of screening of the EMBL3 genomic library failed to yield additional 

25 hybridizing phage containing sequences encoding the other two PMT genes thought to be present in 
the N. tabacum genome and, therefore, a directed cloning approach was pursued using a subgenomic 
library constructed from EcdRI fragments isolated from N. tabacum cv. Xanthi. From this 
hybridization screening, a phage containing the approximately 3.1 kb EcoRl fragment encoding 
NtPMTla was recovered. The coding region of the NtPMTla gene was found to be identical to the 

30 A41 1 cDNA (Hibi et ai, 1994), with the exception of a single base change in Exon 6 that results in a 
conservative amino acid substitution. This difference could be the result of minor differences among 
cultivars used in the two studies (i.e., Xanthi vs. Burley 21). Translation ofthe open reading frame 
contained in NtPMTla showed that it encoded a protein containing four N-terminal 1 1 amino acid 
repeats, similar to Exon 1 of the PMT gene present in N tomentosiformis (Hashimoto et aL, 1998a). 
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Given the observation that NtPMTla encoded a homolog of the PMT gene present in K 
tomentosiformis, the nature and possible evolutionary origin of the remaining PMT gene present in 
the N. tabacum genome was brought into question. From our expression studies (described in detail 
below), we had determined that five distinct PMT encoding transcripts were present in the roots of N. 
5 tabacum, four of which could be accounted for based upon the length of the Exon I coding region in 
the four PMT genes isolated and characterized in our studies described above. The fifth transcript 
was similar in size to that encoded by NtPMTla and, therefore, was designated NtPMTlb. Since the 
variability in PMT gene structure is primarily localized within Exon 1, we used a PCR-based strategy 
to analyze the PMT gene structure and family size in N otophora, the other possible progenitor of N. 

10 tabacum. As shown in Figure 3, five distinct PCR products were detected in the electrophoretic 
pattern of amplification products generated from N. tabacum genomic DNA using Exon 1 specific 
primers (Table 1). Consistent with our studies described above and the previous work of Hashimoto 
et al (1998a), three PCR products were detected in the electrophoretic pattern of amplification 
products generated from N. sylvestris genomic DNA, and a single band was recovered from N. 

15 tomentosiformis genomic DNA. Amplification of genomic DNA from N otophora using Exon 1 
specific primers also yielded only a single band, whose electrophoretic mobility was most similar to 
that of the NtPMTlb derived product. 

Analysis of PMT gene intron and flanking sequences 

20 

The location of the seven introns within the protein coding region of the five PMT genes in N. 
tabacum is identical and appears to be conserved among PMT genes from different Nicotiana 
species. There is also little variation in the nucleotide sequences that comprise the Exon-Intron 
splice junctions in the various PMT genes in N tabacum (Table 2). The high degree of nucleotide 

25 sequence similarity recognized among PMT genes within their coding regions is also present within 
their introns and immediate 5' and 3' flanking sequences (Table 2 and Figure 4). In general, a greater 
level of sequence identity is found in the introns of the NtPMT2 t NtPMT3, and NtPMT4 genes, than 
in pair- wise comparisons among the introns of the other members of the N. tabacum PMT gene 
family. The observed conservation in the intron sequences of the NtPMT2, NtPMT3 t and NtPMT4 

30 genes is consistent with their origin from the same progenitor species (N. sylvestris). One interesting 
exception occurs within Intron 6, where the length of the intron and the sequence similarity is more 
conserved between NtPMTla and NtPMT4 y than between NtPMTA and NtPMT2 or NtPMT3. 

Approximately 1 kb of nucleotide sequence was determined 5' to the coding regions of the 
NtPMTla, NtPMT2, NtPMT3, and NtPMT4 genes (Figure 4). By comparison to the 5'-untranslated 
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region (UTR) present in the A41 1 cDNA, we set the start site for transcription initiation at 
approximately 57 nucleotides upstream of the MET start codon in NtPMTla and NtPMT3, and either 
69 or 60 nucleotides upstream in NtPMT2 and NtPMT4. The major distinguishing feature between 
the S'-UTRs in the various genes is the presence or absence of a 17 bp sequence in the gene. An 
5 appropriately placed TATA-box can be easily recognized 45 bp 5* to the initiation site in all four 

genes. Within the first 200-250 bp upstream of the TATA box, a high level of sequence conservation 
is found to exist among the promoter regions in the four genes. After this point, a clear difference 
can be observed between the NtPMTla promoter and the remaining three genes, and by 400 bp 
upstream, little similarity can be found among any of the gene family members. 

1 0 Analyzing the proximal regions of the various PMT promoters with various motif scanning 

software identified several G-box-like sequences (Foster et aL, 1994; Kim et aL, 1992; Menkens et 
aL, 1995; Staiger et aL, 1989; Williams et aL, 1992) at various positions among the PMT promoters, 
and a potential metal response element (MRE) (positions -75 to -66; numbering relative to the 
NtPMTla promoter sequence) in three of the four PMTs (Cizewski-Culotta and Hamer, 1989; Thiele, 

15 1992). An unusual 17 nucleotide stretch of guanine occurs at positions -259 to -243 in the NtPMTla 
gene promoter followed upstream by a purine-rich region (positions -332 to -263). In the NtPMT3 
promoter a 14 bp palindromic sequence (positions -497 to -484) was detected. PMT gene expression 
has been reported to increase in root tissues following treatment with methyl jasmonate (Imanishi et 
aL, 1998). However, none of the sequence motifs reported to confer methyl jasmonate- 

20 responsiveness in other plant genes (Mason et aL, 1993; Rouster et aL, 1 997) were detected in the 
PMT promoters. 

Comparison of the available nucleotide sequence information from the 3 '-flanking regions of 
the various PMT genes in N. tabacum revealed that the 3'-UTRs in the NtPMT2, NtPMT3, and 
NtPMT4 genes of N tabacum share approximately 81-94% identity with each other and are 

25 essentially identical to those reported for N sylvestris PMTs by Hashimoto et aL (1998a). The major 
distinguishing feature among the various genes is the presence of two short (20 bp and 4 bp) 
deletions in the NtPMT2 gene, which lowers the percent identity. The 3 , -UTR of NtPMTla is 
identical to that reported for the A41 1 cDNA (Hibi et aL, 1994) and 81-94% identical to the other 
PMT genes in the N. tabacum genome. Unfortunately, no sequence information is currently 

30 available for the 3'-UTR of the N. otophora or jV. tomentosiformis PMT genes. 

Regulation of PMT gene expression 



To determine whether the members of the PMT gene family in N. tabacum were differentially 
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expressed, a series of experiments were carried out to define the temporal and spatial distribution of 
transcripts arising from the five genes. Shown in Figure 5A are the results of gel blot analysis of 
total RNA extracted from various tissues of mature Burley 21 tobacco plants hybridized with 
radioactively-labeled probe capable of detecting all five PMT transcripts. Consistent with previous 
5 studies (Hashimoto et al> 1998b; Hibi et a!., 1994), PMT expression is localized exclusively to roots. 
When maturing wild-type (HP) Burley 21 plants are topped (i.e., the floral meristem and upper 1/3 of 
the stem are removed), a dramatic increase in PMT transcript abundance is observed within 2 hr, 
reaching a maximal level of accumulation by 12-24 hr. Two size transcripts are detected on the gel 
blots, reflecting the small difference in message size that occurs as a result of the difference in size of 

10 Exon 1 among the genes. 

In addition to examining PMT gene expression in wild-type plants, we also examined 
expression in a low nicotine-producing (LP) mutant of Burley 21 (Legg and Collins, 1971). The low 
nicotine Burley 21 line harbors mutations at two independent loci (nicJ and nic2) thought to be 
global regulators of gene expression involved in alkaloid formation. As shown in Figure 6B, topping 

15 of the low nicotine mutant (niclnic2) Burley 21 did not cause an increase in PMT transcript 

abundance as observed in wild type plants. Thus, it appears that Nicl and Nic2 are likely involved in 
regulation of PMT expression in the very least, and may also be involved in the regulation of other 
genes in the alkaloid biosynthetic pathway. Whether this is a direct effect (e.g., transcriptional 
activation) or indirect remains to be determined. 

20 In order to determine the extent to which the individual members of the gene family 

contributed to the general pattern of expression described above, a semi-quantitative RT-PCR 
strategy (Kinoshita et al., 1992) was used to detect and quantify the levels of the individual PMT 
transcripts in the roots of both wild-type (HP) and low alkaloid (LP) Burley 21 tobacco. This 
approach takes advantage of the fact that Exon 1 is variable in length among the various PMT genes 

25 (Figure 2), allowing for their individual detection and quantitation following polyacrylamide gel 
electrophoresis and autoradiography. 

Five RT-PCR products (representing Exon 1 from each of the five genes present in N. tabacum) 
were detected in the electrophoretic profiles of amplification products derived from reactions using 
either HP or LP Burley 21 root RNA (Figure 6 A). All five PMT genes present in the N. tabacum 

30 genome were expressed in the roots of wild-type plants, and topping resulted in a differential 

accumulation of transcripts derived from each gene. Among the five genes, transcripts derived from 
the NtPMT2 and NtPMTlb showed the greatest increase in abundance rising approximately 3-fold 
during the first 24 hr post-topping, whereas levels of the NtPMTla and NtPMT4 transcripts changed 
little in response to topping (Figure 6B). In the LP mutant, little or no effect was observed on the 
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levels of the various PMT transcripts following topping, although in some cases (e.g., NtPMTla) a 
small but likely insignificant depression in transcript abundance was detected. Thus, it appears that 
all five genes contribute to PMT activity levels within the root. 

5 II. ADC 

The present invention features the characterization of two members of the nuclear gene family 
encoding ADC in tobacco N. tabacum L. As the following discussion shows, ADC2 is preferentially 
expressed in roots and accounts for the major portion of ADC transcripts present. Furthermore, 
analysis of ADC transcript levels in roots of low and high nicotine producing lines showed that ADC 
1 0 expression is under the control of the Nicl Nic2 regulatory loci. 

Materials and methods 

Plant growth and tissue preparation 

15 

Seeds of N. tabacum cv. Xanthi, wild-type and low alkaloid nicl nicl mutant N. tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington), Tobacco plants used for 
DNA isolation were grown in soihvermiculite mixture in the greenhouse under natural lighting 
conditions. Plants used for RNA extraction were grown either in Moltan Plus (Moltan Co., 

20 Middleton, TN) or hydoponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
plant) from tobacco plants just prior to blooming. Plant tissues were collected from fully matured 
individuals, frozen in liquid nitrogen, and stored at -80 °C until used for RNA preparation (see 

25 below). 

Screening of genomic libraries and phage characterization 

A genomic library constructed in X EMBL3 from N, tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
30 Palo Alto, CA) was screened by plaque hybridization (Sambrook et al, 1989) using an [a- 32 P]- 

dCTP-Iabeled, 2.7 kb EcoRI-Xhol fragment from plasmid PR24 as probe. PR24 encodes a full length 
ADC cDNA isolated from the roots of wild-type N. tabacum cv. Burley 21 (Wang, 1999). 
Hybridization was performed at 65°C for 16 h in a solution containing 0.25 M Na^C^ (pH 7.2) 
and 7% (w/v) SDS. Following hybridization, the membranes were washed twice in 2 x SSC, 0.1% 
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SDS for 15 min at room temperature, once in 0.2 x SSC, 0.1% SDS for 30 min at 65 °C. Hybridizing 
phage were picked and plaque purified through three subsequent rounds of hybridization screening. 
Phage DNA was isolated from plaque purified phage using a Qiagen Phage Midi Preparation Kit 
(Qiagen, Germany) and insert DNA characterized by restriction mapping and DNA gel blot analysis. 
The relevant hybridizing bands in each phage were cloned into pBluescript SK+ vectors for further 
analysis. 

Nucleic acid sequencing and analysis 

Nucleotide sequencing was carried out manually using the Sequenase Version 2.0 protocols 
according to the manufacturer's protocol (United States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 
sequence analysis programs (Altschul et al, 1990; Gish and States, 1993) and protein sequence 
alignments were carried out using the PELEUP program (Genetics Computer Group Sequence 
Analysis package, Version 9.0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 
carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 
(Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol. For 
RNA gel blot analysis, aliquots (10 ng) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1 .2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 
membranes were prehybridized in 7% SDS, 0.25 M NajHPO^ pH 7.2 for 2-4 hours at 65 °C. 
Hybridization was carried out in the same buffer in the presence of 32 P-labeled probes for 16 hr at 
65 °C. The membranes were washed under high stringency conditions and subject to 
autoradiography at -80°C for approximately 48 h. 

For gel blot analysis, [a- 32 P]-dCTP -labeled probes were prepared by random primed labeling 
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(Random Primed Labeling Kit, Boehringer Mannheim, Indianapolis, IN) using 25-50 ng of a 2.7 kb 
EcoRl-Xhol fragment derived from PR24 and a 460 bp fragment amplified from the [3- subunit of the 
tobacco mitochondrial ATP synthase gene (atpl) (Boutry and Chua, 1985). 

5 Semi-quantitative RT-PCR analysis of NtADCl and NtADC2 transcript levels. 

Total RNA (2 ng) from roots, leaves, or floral parts was reverse transcribe at 40°C for 1 h in a 
reaction cocktail containing 200 units of Superscriptll reverse transcriptase (RNase H-, Gibco BRL, 
USA), 10 uriits RNase inhibitor (Perkin Elmer), 200 \im dNTPs and 40 pmol of primer, in total 

10 volume of 20^1. For first strand cDNA synthesis, a single primer [5'- 

AGAAAAACATCACCAACT-3 '] capable of hybridizing to both the ADC1 andADCl transcripts 
was used in the reaction. As a control, a primer ( 5 '-GCAACTGTC ATCTTATCATCTTC-3 ') 
specific for the P-subunit of the tobacco mitochondrial ATP synthase gene apt! (Boutry and Chua, 
1985) was used in the reverse transcriptase reaction. 

1 5 Following reverse transcription, the single stranded cDNA products were serially diluted over a 

concentration range between 1 to 50 ng RNA, and PCR amplification was carried out for 25 cycles 
of 45 s at 94°C, 1 min at 64°C and 1 min at 72°C in a Genemate thermocycler (ISC Bioexpress, 
UT). The reaction mixture contained cDNA template, 1 x PCR buffer (Boehringer Mannheim), 100 
\xM dNTPs, 25 pmol of each forward and reverse primer and 1 unit Taq DNA polymerase. The PCR 

20 reactions specific for ADC1 transcripts contained the following primers: ADCl-forward, 5'- 
CGTAGACGCTACTGTTTC-3 ' and ADC 1 -reverse, 5'-TGGACAAC TGTGGAGGCG-3 \ 
Reactions specific for ADC2 transcripts contained primers ADC2-forward, 5'- 
TGTAGATGCTGCTGTTGTTT-3 and ADC2-reverse, 5'-TGAACAAC TGCGGAGGCA-3 ' . 
Control reactions for normalization of amplification products contained 25 pmol of primers specific 

25 for the tobacco apt2 transcripts: atp2 forward, 5 '-GTATATGGTCAAATGAATGAGCC-3', and atp2 
reverse.int, 5 '-GCAGTATTGTAGTGATCCTCTCC-3 1 . For quantitation purposes, amplification 
reactions were supplemented with l^iCi 32 P-dCTP. PCR products were separated by electrophoresis 
through 1.2% agarose gels, the fractionated reaction products transferred onto a Hybond N+ 
membranes, dried and subject to autoradiography at -70° C. Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the atp2 control band in each lane. The experiment was conducted twice with 
different total RNA samples, and representative results are presented from one of the two 
experiments. 
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Results 

These studies show the structure and expression of individual members of the ADC gene 
family in tobacco. An cc- 32 P-dCTP-iabeled 2.7 kb EcoRI-XhoI fragment from PR24 encoding the 
ADC coding region was used to screen an k EMBL3 phage genomic library. From a screen of 
approximately 3 X10 5 phage, seventeen hybridizing phage were recovered, of which five were fully 
characterized by restriction mapping and DNA gel blot analysis. These phage fell into two groups 
based on their restriction profile. The relevant hybridizing fragments from the various phage were 
cloned into pBluescript and their nucleotide sequence determined. 

Presented in Figure 7 are the nucleotide and predicted amino acid sequences of NtADC-1 and 
NtADC-2 genes. Both genes contain a single open reading frame, uninterrupted by introns. The 
nucleotide and amino acid sequence encoded in NtADC-1 is identical to that of PR24, the full length 
cDNA isolated from N. tabacum cv Burley 21. There are 84 nucleotide differences within the 
NtADC-1 and NtADC-2 coding regions, resulting in 23 amino acid differences between the ADC1 
and ADC2 proteins, respectively. The ADC1 protein is one amino acid shorter in length, missing 
Val-13. 

By comparison to the full-length cDNA, the 5 '-untranslated region (UTR) present in 
NtADC-1 and NtADC-2 are 43 1 bp and 432 bp long, respectively. The size of the 5'-UTR in the 
ADC transcripts is considerably larger than the average size of the plant leader sequence (Joshi, 
1987). In contrast, the 3'-UTRs present in NtADC-1 and NtADC-2 are relatively short, 
approximately 84 nucleotides in length. In both gene sequences, a conserved polyadenylation signal 
(AATAATA) can be recognized 23 nucleotides from the site of polyadenylation site found in the 
PR24 cDNA. 

. Pairwise comparison .of the N. tabacum ADC1 and ADC2 proteins with the ADCs of other 
plant species showed that the N. tabacum proteins are approximately 82% identical to the ADC of 
its evolutionary progenitor species N. sylvestris [Genbank Accession No. ABO 12873] and 86% 
identical to the ADC from tomato (Lycopersicon esculentum) [31], another member of the 
Solanaceae family (Figure 2). As might be expected, the N, tabacum ADC shares considerably less 
similarity to ADCs isolated from species more distantly related evolutionarily, such as Arabidopsis - 
67% identical [32, 33], soybean- 67% identical [34], and oat - 42% identical [35] and is only 29% 
identical to the enzyme from Escherichia coli - [36]. 

The predicted protein coding regions for the N. tabacum ADCs are substantially longer than 
those reported for the ADC proteins of N. sylvestris and L. esculentum [31], but are similar in length 
to those reported in Arabidopsis, oat, soybean [32-35] and for the E. coli enzyme [36]. The 
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difference in overall length appears to arise from an apparent nucleotide deletion in the N. sylvestris 
and tomato cDNA sequences relative to the ADC1 and ADC2 predicted sequence and those in other 
plants. In the nucleotide sequences reported for both the N. sylvestris and tomato cDNAs, a guanine 
residue (position 2295 in the N. sylvestris sequence and 1531 in the tomato sequence) is missing 
5 [Genbank Accession No. ABO 12873]. This deletion changes the reading frame and introduces a 
premature termination to the predicted coding region. Using the sequence information available in 
the NCBI database, correcting for this error allowed us to extend the predicted C-terminus of the 
both ADC proteins, yielding the alignment to the N, tabacum ADCs and those of other plant ADCs 
as indicated in Figure 8, We have also included in the alignment shown in Figure 8, the correction at 
10 the N-terminus of the predicted tomato ADC protein sequence noted by P6rez-Amado et al. [37], 
allowing better alignment of all of the higher plant sequences. 

Developmental regulation of arginine decarboxylase expression 

1 5 It has been shown that nicotine formation can be activated in the roots of maturing tobacco 

plants by topping, that is, removal of the flower head and several young leaves (Akehurst, 1981 ; 
Hibi, et al., 1994). Coincident with the activation of nicotine formation, there is an increase in the 
levels of transcripts encoding ODC, PMT and spermidine synthase (SPS) over the subsequent 24 hr 
period in wild-type plants (Hibi et al., 1994; Riechers and Timko, 1999). To determine the effects of 

20 topping on ADC expression in roots, Burley 21 plants were grown in the greenhouse to the bud stage 
at which point the upper 1/3 of the plant was removed and samples of roots tissues were collected 
before and at various times post-topping. As shown in Figure 9, ADC message abundance increased 
in the roots of topped Burley 21 plants during the 24 hr period after topping. Low alkaloid (LA) 
mutants of Burley 21 show a much lower level of ADC expression in their roots, and no induction of 

25 ADC transcript accumulation after topping. The lack of ADC induction in the low-alkaloid mutant is 
consistent with previous studies (Hibi et al, 1994; Riechers and Timko, 1999; Wang, 1999) showing 
a general inability to activate gene expression leading to increased polyamine formation and alkaloid 
biosynthesis as a result of the mutation of the Nicl and Nic2 regulatory genes. 

30 NtADC-2 is predominately expressed in roots of wild-type plants. 

Due to the high degree of identity between the NtADC-1 and NtADC-2 transcripts (e.g., 95.8% 
coding regions, 94.4% and 96.4% in 5'- and 3'-UTRs, respectively), it is impossible to distinguish 
between the two transcripts by RNA gel bot analysis. Therefore, we employed a RT-PCR based 
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strategy and gene specific oligonucleotide primers. Total UNA was extracted from tobacco roots, 
leaves and flowers, and single-stranded cDNA synthesized using an oligonucleotide primer capable 
of hybridizing to both ADC1 and ADC2 transcripts. As an internal control for amplification, a gene 
specific primer recognizing the atp2 transcript encoding the P-subunit of the tobacco mitochondrial 
ATPase was include in the reactions. Under experimental conditions providing amplification in the 
linear range of the PCR reaction, gene specific forward and reverse primers were used to specifically 
amplify either ADC1 or ADC2 cDNAs. Test reactions (Figure 10A) using plasmid DNA encoding 
NtADCl or NtADC2 as template demonstrated the specificity of the primers. As shown in Figure 
10B, the main transcripts detectable in all tissues tested are derived from NtADC-2. Flowers express 
the highest level of ADC, and leaves lowest. In the flowers, although ADC1 is detectable, far less 
than ADC2 Roots also express a significant level of ADC. 

ADC transcript levels are highest in the roots and floral organs, and low in other plant tissues. 
The two ADC genes investigated appear to have different modes of regulation, with ADC2 being 
predominately expressed in the roots and other organs. 

At the present time, only limited information is available on the nature of regulatory regions 
in the promoters of genes encoding enzymes of alkaloid biosynthesis. The availability of cloned 
genomic fragments encoding ADC allows one to begin mapping regulatory sequences within 
members of these genes responsible for tissue specific, developmental, and inducible expression. 



III. ODC 

The present invention features the genes of two members of the nuclear gene family encoding 
ODC in tobacco N. tabacum. As the following experimental discussion shows, the ODC-2 gene is 
preferentially expressed in roots and floral tissues. Furthermore, the abundance of ODC transcripts in 
root tissues is affected by topping. Furthermore, analysis of ODC transcript levels in roots of low 
and high nicotine producing lines shows that ODC expression is under the control of the Nicl Nic2 
regulatory lqci 
Materials and methods 

Plant growth and tissue preparation 

Seeds of N. tabacum cv. Xanthi, wild-type and low alkaloid nicl nic2 mutant N. tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington). Tobacco plants used for 
DNA isolation were grown in soil:vermiculite mixture in the greenhouse under natural lighting 
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conditions. Plants used for RNA extraction were grown either in Moltan Plus (Moltan Co., 
Middleton, TN) or hydroponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
5 plant) from tobacco plants just prior to blooming. Floral parts and other plant tissues were collected 
from fully matured individuals, frozen in liquid nitrogen, and stored at -80 °C until used for RNA 
preparation (see below). 

Screening of genomic libraries and phage characterization 

A genomic library constructed in EMBL3 from N. tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
Palo Alto, CA) was screened by plaque hybridization (Sambrook et al % 1989) using a 32 P- 
radiolabeled, 1.6 kb EcdBl-Xhol insert from plasmid PR46 as probe. PR46 encodes a full length 
ODC cDNA previously isolated by differential screening of plasmid libraries prepared from mRNA 
isolated from the roots of wild-type Burley 21 plants before and 3-days post-topping (Wang, J., 
Sheehan, M, Bookman, H, and Timko, M.P., unpublished data). Hybridization was performed at 
65 °C for 16 h in a solution containing 0.25 M Na^O,, (pH 7.2) and 7% (w/v) SDS. Following 
hybridization, the membranes were washed twice in 2 x SSC, 0.1% SDS for 15 min at room 
temperature, once in 0.2 x SSC, 0.1% SDS for 30 min at 65 °C. Hybridizing phage were picked and 
plaque purified through three subsequent rounds of hybridization screening. Phage DNA was isolated 
from plaque purified phage using a Qiagen Phage Midi Preparation Kit (Qiagen USA, Valencia, CA) 
and insert DNA characterized by restriction mapping and DNA gel blot analysis. The relevant 
hybridizing bands in each phage were cloned into pBluescript SK+ vectors for further analysis. 

Nucleic acid sequencing and analysis 
25 Nucleotide sequencing was carried out manually using the Sequenase Version 2,0 protocols 

according to the manufacturer's protocol (United States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 
30 sequence analysis programs (Altschul et al., 1990; Gish and States, 1993) and protein sequence 
alignments were carried out using the PILEUP program (Genetics Computer Group Sequence 
Analysis package, Version 9.0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 
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carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 
5 (Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol. For 
RNA gel blot analysis, aliquots (10 \ig) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1.2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 

10 membranes were prehybridized in 7% SDS, 0.25 M Na 2 HP0 4) pH 7.2 for 2-4 hours at 65 °C. 

Hybridization was carried out in the same buffer in the presence of 32 P-labeled probes for 16 hr at 
65 °C. The membranes were washed under high stringency conditions and subject to 
autoradiography at - 80° C for approximately 48 h. 

Restriction fragments derived from cDNA clones of interest were separated by agarose gel 

15 electrophoresis, the DNA was purified, and quantified by spectrophotometry. [ 32 P]-dCTP -labeled 
probes were prepared from 25-50 ng of insert DNA by random primed labeling (Random Primed 
Labeling Kit, Boehringer Mannheim, Indianapolis, IN). As a control, the blots were also probed with 
radioactively labeled probes encoding the alkaloid biosynthesis enzyme putrescine N- 
methyltransferase (PMT) (Riechers and Timko, 1999), a root specific, topping inducible P- 

20 glucosidase encoding cDNA (TBG-1) (Riechers, D.E. and Timko, M.P., unpublished data), 26S 
rRNA (PR31) or 28S rRNA fragments. 

Genomic DNA isolation and gel blot analysis 

Tobacco genomic DNA was prepared from tobacco leaf tissue by the method of Junghans and 

25 Metzlaff (1990). Total genomic DNA (15 ng) was digested to completion with EcoRl or Hindlll, the 
digestion products were fractionated by electrophoresis through a 0.8% (w/v) agarose gel, and 
transferred onto Nytran nylon membrane (Schleicher & Schuell, Keene, NH) in the presence of 0.4 N 
NaOH (Sambrook et ai, 1989). Following transfer, the membrane was rinsed in 2 X SSC , the DNA 
was UV cross-linked to the membrane, and the membrane was prehybridized and hybridized as 

30 described above. Following hybridization and washing, the membranes were subjected to 
autoradiography at -80 ° C. 

Results and discussion 
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Gel blot analysis of tobacco genomic DNA cut with various restriction enzymes and 
hybridized with an [a- 32 P]-dCTP-labeled 1.6 kb EcoRI-XhoI cDNA fragment (PR46) encoding the 
full-length ODC protein from N. tabacum cv Burley 21 (Wang, J., Sheehan, M., Bookman, H. and 
Timko, M.P., unpublished data) indicated ODC is encoded by small gene family in the N. tabacum 
5 genome (Fig. 1 1). Four to five major bands and several minor bands of sufficient size to encode full- 
length genes are detected in either EcoRI or Hindlll digested tobacco DNA. 

To further analyze the structure and regulation of members of the ODC gene family in 
tobacco, a X EMBL3 phage genomic library constructed with DNA from N, tabacum cv Xanthi was 
screened using a [a- 32 P]-labeled probes prepared from PR46 (as described above). From a screen of 
10 approximately 3 X10 5 phage, five hybridizing phage were recovered, of which three were fully 
characterized by restriction mapping and DNA gel blot analysis. Two phage proved to contain 
identical insert DNA and the third had a unique restriction digestion profile. Following DNA gel blot 
analysis, the hybridizing fragments were cloned into pBluescript and their nucleotide sequence 
determined. 

15 The complete NtODC-2 gene spans two Sail fragments of 2.7 kb and 6.5 kb. The coding 

region of the gene contains a singlel302 bp open reading frame uninterupted by introns (Fig. 12). 
The nucleotide sequences of NtoDC-2 is identical within the coding and 5 T and 3'- untranslated 
regions to the PR46 encoded cDNA, with the exception of four nucleotide changes (residues +2, +4, 
+6 and +8) in the 5'-untranslated region. These nucleotide differences likely reflect changes 

20 introduced during the cDNA synthesis reaction. 

The predicted amino acid sequence for the NtODC-2 encoded protein (designated pODC2) 
(Fig. 13) is identical to the ODC characterized from Burley 21 tobacco encoded by PR46 (Wang, J., 
Sheehan, M. 5 Bookman, H. and Timko, M.P., unpublished data) and to the partial N, tabacum ODC 
cDNA sequence (PR17) reported by Malik et ai t (1996). Comparison of the predicted amino acid 

25 sequence for pODC2 with the ODC proteins characterized from two different tobacco cultivars 

showed that the pODC2 differs by 7 amino acid (98% identity) from the ODC protein characterized 
from the high alkaloid cultivar, N, tabacum cv. SC58 [Genbank Accession No. Y 10472.1] and by 7 
amino acid (98% identity) from ODC protein from BY-2 cells. The tobacco pODC2 is 89% and 
90% identical to the ODCs from tomato (Lycopersicon esculentum) and jimsonweed (Datura 

30 stramonium), respectively, but substantially less similar to ODCs from yeast (35% identity) and 
humans (32% identity). 

The NtODC-1 gene, contained on an 4.0 kb Xbal fragment, encodes a single open reading 
frame of 141 amino acids encompassing the amino terminal one-half of ODC (Fig. 12). Six amino 
acid residue changes distinguish the NtODC-2 and NtODC-1 encoded proteins over the homologous 
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region of the proteins. Beginning at amino acid residue 130, the NtODC-1 encoded protein (pODCl) 
diverges from pODC2, with a stop codon present after residue 141. Scanning the available 
nucleotide sequence (> 1 kb) in the 3'-flanking region of the NtODC-1 gene failed to reveal any 
evidence for ODC homologous protein sequences in any of the three translational reading frames. 
5 Interestingly, a comparison of the 5'-flanking sequence of the NtODC-1 and NtODC-2 genes 
revealed that while the NtODC-2 gene has a clearly recognizable TATA-box properly located at 
approximately -35 bp from the transcriptional start site, no such regulatory motif is found in the 
NtODC-1 gene sequence. Consistent with this observation, RNA gel blot analysis performed using a 
hybridization probe prepared from NtOCD-1 immediately downstream of the frame shift, failed to 

10 detect any message in various tissues of mature tobacco plants (data not shown). Thus, it appears that 
NtODC-2 represents an unexpressed pseudogene in the N, tabacum genome. 

To determine the spatial pattern of expression of the NtODC-2 gene, gel blot analysis was 
carried out using total RNA prepared from roots, stems, young and mature leaves, and various floral 
parts of Burley 21 tobacco plants. As shown in Fig 14, transcripts encoding ODC were easily 

15 detected in the roots, with little or no expression in other tissues except sepals, carpels, and mature 
stamens. 

The formation of nicotine and total leaf alkaloids in tobacco is known to be under the control 
of at least two independent genetic loci (Legg et aL, 1969; Legg and Collins, 1971), designated Nicl 
and Nic2 (Hibi et aL, 1994), Nicl and Nic2 are semidominant and operate synergistically to control 

20 plant alkaloid content, with mutations within these genes resulting in plants with reduced levels of 
nicotine and total leaf alkaloids (wild-type > nicl > nic2 > nicl nic2) (Legg et aL, 1969; Legg and 
Collins, 1971), Although no information is available on the nature of their encoded products, it has 
been speculated that Nicl and Nic2 likely encode transcriptional regulators capable of globally 
interacting with a subset of genes encoding components of polyamine and alkaloid biosynthesis 

25 (Hibi et aL, 1994). Removal of the flower head and several young leaves (i.e., topping) leads to 

activation of nicotine formation in the roots of decapitated plants (Akehurst, 1981; Hibi et aL , 1 994). 
To determine the effects of topping on NtODC-1 expression in roots, Burley 21 plants were grown in 
the greenhouse to the bud stage at which point the upper 1/3 of the plant was removed and samples 
of roots tissues were collected before and at various times post-topping. As shown in Fig 14B, low 

30 levels of the ODC transcripts were found in roots prior to topping and message abundance increased 
approximately 2-fold in the roots of topped Burley 21 plants 4 hr after topping. By 24 hr after 
topping, ODC transcript levels return to their initial levels. Low alkaloid mutants of Burley 21 
subjected to the same treatment show a much lower level of stimulation of ODC transcript 
accumulation after topping, and the enhanced transcript abundance does not persist beyond 4 hr. By 
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comparison, transcripts encoding PMT and and a tobacco root-specific P-glucosidase (TBG-1) show 
patterns of accumulation similar to that observed for ODC transcripts in wild-type plants, but no 
induction in the low-alkaloid mutant, consistent with previous studies (Hibi et ai t 1994; Riechers 
andTimko, 1999; Wang, 1999). 

5 

IV. SAMS 

A single recombinant phage is identified as encoding for SAMS. This X phage contains an 
approximately 15kB Sail insert. Restriction mapping and PCR analysis indicates that the insert 
DNA contains primarily the coding and 3 'non-coding portions of the SAMS gene. The nucleotide 
10 sequences for the gene encoding SAMS can be found at GenBank Accession Nos. AP27243 (full 
length SAMS cDNA). 

V. NADH dehydrogenase 

A fragment of the cDNA encoding for NADH dehydrogenase in N. tabacuum shows high 
1 5 expression in the roots of mature wild-type HP plants compared to low alkaloid mutant LP plants. 

VL Phosphoribosylanthranilite isomerase (PAI) 

The gene encoding for a fragment of phosphoribosylanthranilite isomerase in N. tabacuum is 
a homolog of the Arabidopsis thaliana gene encoding PAI, an enzyme involved in tryptophan 
20 biosynthesis. This eixzyme is involved in the overall formation of aromatic compounds in plants. 
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What is claimed is: 

I . An isolated DNA molecule comprising the nucleotide sequence of (SEQ. ID. NO. 2), (SEQ. ID. 
NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 1 1), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15), (SEQ. ID, 

5 NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) or 

comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID, NO. 19), (SEQ. ID. NO. 22) OR (SEQ. ID. NO. 24). 

10 2. A vector comprising the isolated DNA molecule of claim 1 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

3. An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of claim 1. 

15 

4. A vector comprising the isolated DNA molecule of claim 3 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

5. A cultured transgenic tobacco cell stably transformed with the vector of claim 2. 

20 

6. A cultured transgenic tobacco cell stably transformed with the vector of claim 4. 

7. A transgenic tobacco plant stably transformed with the vector of claim 2. 

25 8. A transgenic tobacco plant stably transformed with the vector of claim 4. 

9. The isolated DNA molecule of claim 1, wherein the isolated DNA molecule comprises the 
nucleotide sequence of (SEQ ID NO:). 

30 10, A vector comprising the isolated DNA molecule of claim 9 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

I I. An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of the isolated DNA molecule of claim 9. 
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12. An isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
sequence found in (SEQ. ID. NO. 2), (SEQ. ID. NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 1 1), 
(SEQ. ID. NO. 13), (SEQ. ID. NO. 15), (SEQ. ID. NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), 

5 (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26). 

13. A cultured transgenic tobacco cell stably transformed with the vector of claim 10. 

14. A transgenic tobacco plant stably transformed with the vector of claim 10. 

.10 

15. A vector comprising a DNA sequence which encodes an antisense mRNA which is 
complementary to a fragment of a mRNA encoded by the isolated DNA molecule of claim 1, 
wherein said sequence is operably linked to sequences capable of directing the transcription of said 
antisense mRNA in tobacco cells and wherein the expression of said antisense mRNA in tobacco 

1 5 cells is sufficient to provide for reduced nicotine content in tobacco cells which are stably 
transformed with said vector as compared to untransformed tobacco cells. 

16. A cultured transgenic tobacco cell stably transformed with the vector of claim 15. 

20 17, An isolated and purified protein comprising the amino acid sequence identified in (SEQ ID NO. 
3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ED. NO. 19), (SEQ. ED. NO. 22) or (SEQ. ID. NO. 24). 

1 8. A method for regulating gene expression in a plant comprising functionally linking an alkaloid 
25 gene promoter to a nucleic acid encoding a protein, wherein the promoter comprises a nucleic acid 

sequence selected from the group consisting of the sequences identified in (SEQ ID NO. 1), (SEQ. 
ID. NO. 4), (SEQ ID. NO. 7), (SEQ. ED. NO. 10), (SEQ. ID. NO. 17),and (SEQ. ID. NO. 20). 

19. The method of claim 18, wherein the nucleic acid encoding a protein encodes a protein involved 
30 in the biosynthesis of alkaloids in plants. 

20. A plant transformed by the method of claim 18. 
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tataataaca tcagtgaagc agaattgaaa gccatcaagt atgggtgtga atggtgcaaa tacaaaggaa tatcaaactt -709 
ctgaggacag tagtaaaaac aatatcagac aaaaagtaaa gttgtattat ttagcttgag gataaagtat gtcattagtt -719 



NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 



cattgtggaa actgactcga ggatgatcta tgacatacta cagaccaaaa gtcfaagcaa caacaagttg aaacaagaga -629 
ttgtgagaga tttggtgtcc tctacaatga ttgttgaagt ccctatttat fgcfatacac aggaaacaaa atcctaggat -639 

_ g §ati$caatgg agaaggaaaa tatttccagt -625 



NtPMT3 ccgagaaatt aatggagatt ctggac a?c_"-gcT9g acacc tgttacccat tgccttcgcg aagcajgstca aggggcagac -54 9 

NtPMT2 caagcccctc ttaaatgaca ataatggggt taatgatgaa tatgtagcgg catgacatga atgcc^aat tcfeccgcaac -559 

NtPMT4 ——————— —-————_— — __________ __________ ______ 

NtPMTla gtaaacacaa gtgaatgaag agaagccaaa ataatctcta tcattcaagc cttaggtgga gatta|$aaa atgatttact -54 5 



NtPMT- 
NtPMT2 
NtPMT 4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPMT4 
NtPMTla 

NtPMT 3 
NtPMT2 
NtPMT4 
NtPMTla 

NtPMT 3 
NtPMT2 
NtPMT 4 
NtPMTla 

NtPMT3 
NtPMT2 
NtPMT 4 
NtPMTla 



PAL 

- - v----. -. r — a _ -LL.agacag gtatcaaaag cggccaaggg -469 

gactatttat ttaftattga ggaatatttt ttattagat$ (ttatctggtg acaagcattc gtttgcttcc gttgattacg -479 

ttcttatcaa agtgataggt gatcaacagc tttcgt|aa| fgtcattagg agaatattat aatctctttt atgctgaaga -465 



tggtttgcta aag|ggccac cagagctaac gaaggt|tc| gtcat BeSgT 

_ .... .... c^g 



ccctttc 
ttgattt 



atggatatg| ggcaggtccc ttattttaga attaga^atg aaaaaEctaa tttttttttg 
^ gatctactcjj ataccaaccg aagccgttgt ccttga|ctt cgcttfccatt taattcatct 

acccaca|aa ggaagatca| aaaatacatg actttcagat gacttcftgg agctt|attt ttaaagagtg 

ctgtg|atag 
tccgggttca 
tctg||tcca 
cagcaaagag 



taagttaatt -389 
399 
389 
385 



tccgt^tgcc 
— gtggacc 
gctag&tggt 



tg^gagfgaaa tcgtctaatS tgta|$fet||i ccc§tagSc^ 
caaig^a^gc acccattc§g tta^^a^ gaa^cca'"'"'""' 
"fcg^algc acccattc^aj tta£|pa|_i ga|$cca 
gtgciggica gatatcatgg aat^Ucga tt#tttgt£ 



giatg.gtaaggt tgatsttccc 
f£|t3ct|ta c|t|j|t|t| 

Sqiaaaglaa gaggasggga 




NtPMT3 |^gPta| fgf ' — - S&§P£P i 

NtPMT? $£^t*^| Isaaatcgta atacttgttg B^aii^i§ 

NtPMT 4 |^t$£$g£ p&aatcata atacttgttg 

NtPMTla ______sb_-*- 
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WO 00/67558 



PCT/USOO/12450 



6/21 





WO 00/67558 PCT/USOO/12450 



7 , 

NtADCl 

ttcacgttctcttctcaattcccataaaagaaaccctt 
gtttccgtcctatttt—ctcttcttctacgcttc 78 
NtADC2 



tc. . .a c 

. .t. . . 80 

NtADCl 

ctcttctgatatcaatatctgtatggtgtttttcttg 
ttcgaattttagatttgttttgcctttaatacctgta 
acctta 158 
NtADC2 

a 

t a. . 

160 

NtADCl 

taattctctgtttaaaccaaaaacttagcttcttctg 
aagtcagggtggggatatttggatcgtgtaagagtgt 
gttaga 238 
NtADC2 - 



t 

239 

NtADCl 

aggtgattatcttttgattcagttccttttttgcttc 
ttttgagggggtagccggggcctcggcctcggcgggt 
tttaat 318 
NtADC2 

g 



21 



grta'cT 319 
NtADCl 

agcccccatctattacaaccattgggcaaaaacatca 
ttaaatctgtacaaagcaaacccttaatttagtttaa 
ttttct 398 
NtADC2 

t 

a 

399 

1 

MPALGCCVDAT- 
V S P P 
NtADCl 

gtattctttgattctttaacagaagaagaagagATGC 

CGGCCCTAGGTTGTTG CGTAGACGCTACT 

GTTTCCCCTCC 475 
NtADC2 

a t ATG. 

T T . , . G . . GTT 

479 

1 

M*********AV 
* * * * 

16LGYAFSRDSS 
LPAPEFFTSGVPP 
T N S A 
NtADCl 

CTCGGCTATGCCTTCTCTCGGGATAGCTCTCTTCCCG 
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CGCCGGAGTTCTTTACCTCCGGCGTACCTCCTACAAA 

CTCCG 555 

NtADC2 

. . .A 

G G 

. . .T. 559 

17 *S******** 

*******A***** 

* * * * 

43 AGSIGSPDLS 
SALYGVDGW GAPY 
F S V 

NtADCl 

CCGCCGGTTCCATTGGGTCTCCGGATCTGTCCTCTGC 
TTTGTACGGGGTCGATGGGTGGGGAGCTCCTTATTTC 
TCCGTT 63 5 
NtADC2 

T.C t. . . .G 

. . .A 

. .T. . . 639 

44 *A******** 
************* 

* * * 

69NSNGD I SVRP 
HGTDTLPHQE IDL 
L K V V 
NtADCl 

AACTCTAACGGAGATATCTC CGTC CGACCACATGGT A 
CGGACACACTCCCCCACCAGGAAATTGACCTTCTCAA 
GGTCGT 715 



NtADC2 

T c 

T T 

719 

70********** 
************* 
* * * * 

96K KASD PKNSG 
GLGLQLPLVVRFP 
D V L K 
NtADCl 

GAAAAAGGCCTCCGACCCGAAAAATTCAGGGGGGCTC 
GGGCTTCAGCTGCCTCTTGTTGTTCGCTTCCCTGATG 
TGCTAA 795 
NtADC2 

T T 



. .T.G. 799 

97********** 
************* 
* * * * 

123 NRLESLQSAF 
DLAVHSQGYGAHY 
Q G V 
NtADCl 

AAAACCGGTTGGAATCTCTGCAATCGGCTTTTGATCT 
CGCTGTTCATTCCCAGGGCTATGGGGCCCACTACCAA 
GGTGTT 875 
NtADC2 



WO 00/67558 

9 / 

. . .G 

879 

124 ********** 
************* 

* * * 

149 YPVKCNQDRF 
VVBDIVKFGSSFR 
F G L E 

NtADCl 

TATCCCGTGAAATGCAATCAAGACAGGTTCGTGGTGG 
AAGATATTGTCAAATTCGGGTCGTCATTCCGGTTCGG 
GTTGGA 955 
NtADC2 

C c 

959 

150 ********** 
**********p** 

* * * * 

176 AGSKPELLLA 
MSCLCRGSAEGLL 
V C N G 
NtADCl 

AGCTGGGTCTAAACCCGAGCTCCTGTTAGCCATGAGC 
TGTCTCTGCAGGGGCAGTGCTGAGGGCCTTCTCGTTT 
GCAATG 1035 
NtADC2 

. . .C 

A 

1039 



PCT/USOO/12450 

21 

177 ********** 
*****K******* 

* * * * 

203 FKDAEYI SLA 

LVARKLMLNTVIV 

L E Q 

NtADCl 

GTTTCAAGGACGCTGAGTACATTTCGCTTGCTTTGGT 
TGCAAGAAAGCTCATGTTAAACACTGTAATTGTTCTT 
GAACAA 1115 
NtADC2 

G. . . 

1119 

204 ********** 
************* 

* * * 

229 EEELDLVIDI 
SRKMAVRPVIGLR 
A K L R 

NtADCl 

GAGGAGGAGCTTGACCTTGTGATTGATATAAGCCGTA 
AGATGGCTGTTCGGCCCGTAATTGGACTTCGGGCTAA 
GCTCAG 1195 
NtADC2 

A. . 

T 

1199 

230 ********** 
*H*********** 

* * * * 



tfcjujci (c) 
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256 TKHSGHFGST 
SGEKGKFGLTTTQ 
I V R V 
NtADCl 

GACCAAGCATTCAGGCCATTTTGGATCCACTTCTGGA 
GAAAAAGGTAAGTTTGGGCTTACAACGACCCAAATTG 
TTCGTG 1275 
NtADC2 



1279 

257 * * * * ****** 
************* 

* * * * 

283 VKKLEESGML 
DCLQLLHPHIGSQ 
IPS 

NtADCl 

TAGTGAAGAAGCTGGAAGAATCCGGAATGCTGGATTG 
CCTTCAGTTGCTGCATTTTCACATTGGATCTCAGATC 
CCTTCA 1355 
NtADC2 

.G A 

T 

T 1359 

284 ********** 
************* 

* * * 



309 TALLADGVGE 
AAQIYCEL IRLGA 
G M K F 

NtADCl 

ACGGCGTTGCTTGCTGATGGTGTTGGTGAGGCTGCTC 
AGATTTATTGTGAATTAATCCGTCTTGGTGCGGGTAT 
GAAGTT 1435 
NtADC2 

G A A C 

G A 

1439 

310 *G******** 
********V**** 
* * * * 



336 IDTGG GLGID 
YDGTKSCDSDVSV 
G Y G I 

NtADCl 

CATTGATACTGGAGGTGGGCTCGGAATTGATTATGAT 
GGTACTAAATCATGTGATTCAGATGTCTCTGTTGGCT 
ATGGCA 1515 
NtADC2 

T T 

C T 

1519 

337 * * I ****** * 
************* 
* * * * 
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363 QEYASTVVQA 
VQYVCDRKGVKHP 
VIC 

NtADCl 

TTCAAGAATACGCCTCCACAGTTGTCCAGGCGGTTCA 
ATATGTTTGCGACCGTAAGGGCGTGAAGCACCCAGTG 
ATTTGC 1595 
NtADC2 

T G T 

A T A 

. .C. . , 1599 

364 *****a**** 
************* 
* * * 

389 SESGRAIVSH 
HSILIFEAVSASS 
H S C S 
NtADCl 

AGCGAAAGTGGCAGGGCAATTGTTTCTCATCACTCAA 
TTCTGATTTTCGAAGCCGTGTCTGCTTCTAGTCACTC 
ATGTTC 1675 
NtADC2 



1679 

390 ********** 
************* 
* * * * 

416 SSHLSSGGLQ 
SMAETLNEDALA D 
Y R N L 



NtADCl 

TTCTTCACATCTGTCTTCTGGTGGCCTCCAATCCATG 
GCGGAGACGCTCAATGAAGATGCCCTTGCTGATTACC 
GCAATT 1755 
NtADC2 



C 

1759 

417 ********** 
************* 
* * * * 

443 SAAAVRGEYE 
TCVLYSDQLKQRC 
V D Q 
NtADCl 

TATCTGCTGCTGCAGTTCGTGGAGAGTACGAGACGTG 
TGTACTTTACTCTGATCAGTTGAAACAGAGATGTGTG 
GATCAG 1835 
NtADC2 

T A. . 



1839 

444 ********** 
************* 
* * * 

469 FKEGSLGIEH 
LAAVDS I CDFVSK 
A M G A 
NtADCl 

TTTAAAGAAGGGTCCTTGGGTATTGAACATCTTGCTG 
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CTGTTGATAGCATCTGTGATTTTGTATCAAAGGCTAT 

GGGGGC 1915 

NtADC2 



1919 

470 ********** 
************* 
* * * * 

496 ADPIRTYHVN 
L S I FT S I PDFWAF 
G Q L F 
NtADCl 

TGCTGATCCTATCCGCACTTACCATGTGAATCTGTCA 
ATTTTCACTTCAATTCCTGATTTTTGGGCCTTTGGTC 
AATTGT 1995 
NtADC2 

G. . 



1999 

497 ***v****** 
************* 
* * * * 

523 P IVPIHRLDE 
KPAVRGI LSDLTC 
DSD 
NtADCl 

TTCCGATTGTTCCAATACACCGTTTAGATGAAAAGCC 
TGCAGTAAGGGGAATATTATCGGACTTGACTTGTGAC 
AGTGAT 2075 



NtADC2 

T C 

G A 

2079 

524 ********** 
************* 
* * * 

549 GKVDKFIGGE 
SSLQLHELGSNGD 
G G G Y 
NtADCl 

GGGAAGGTTGATAAGTTCATTGGTGGCGAATCAAGCT 
TGCAGCTGCATGAATTGGGAAGTAATGGCGATGGTGG 
TGGGTA 2155 
NtADC2 



...C.A 

. . .T. . 2159 

550** * * * * * * * * 
***p********* 
* * * * 

576 YLGMFLGGAY 
EEALGGLHNLFGG 
P S V V 
NtADCl 

TTATCTGGGGATGTTTTTGGGTGGGGCTTATGAGGAG 
GCGCTCGGAGGACTCCACAACCTGTTTGGTGGACCAA 
GCGTGG 2235 
NtADC2 
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.T..C. 2239 

577 ********** 
************* 

* * * * 

603 RVVQSDSAHS 
FAMSRSVPGPSCA 
D V L 

NtADCl 

TGCGCGTGGTGCAGAGCGATAGCGCTCACAGCTTCGC 
CATGTCTCGCTCCGTCCCTGGCCCGTCCTGCGCGGAC 
GTGCTC 2315 
NtADC2 

T. . 

... .A X T.T . 

2319 

604 ******** ** 
***T********* 

* * * 

629 RAMQHEPELM 
FETLKHRAEEFLE 
Q E E D 
NtADCl 

CGAGCGATGCAGCACGAGCCCGAGCTCATGTTCGAGA 
CTCTCAAGCACCGTGCGGAGGAATTCTTGGAACAAGA 
AGAAGA 2395 
NtADC2 



. . .T. . 2399 



630 ********** 
************* 
* * D * 

656 KGLAIASLAS 
S LAQS FHNMPYLV 
A P A S 
NtADCl 

CAAAGGGCTGGCCATTGCATCTTTGGCCAGCAGCTTA 
GCTCAGTCCTTCCATAACATGCCTTACCTTGTGGCGC 
CTGCAT 2475 
NtADC2 

- TG. . .A g. . 



. .T. . . 2479 

657 ****-VE****. 
*V*********** 

* * S * 

683 CCFTAVTANN 
GGYNYYYSDENAA 
D S A 

NtADCl 

CTTGCTGCTTCACTGCAGTTACTGCTAACAACGGTGG 
CTATAACTACTATTACAGTGATGAGAATGCAGCAGAT 
TCTGCT 2555 
NtADC2 

C T.C A T 

T 

2559 

684 *R***A*D** 
************* 

* * * 
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709 TGED EIWSYC 
T A *** 

NtADCl 

ACAGGGGAGGATGAGATTTGGTCCTATTGCACTGCTT 

GAagtgttgtcgtagcatctccagttttagtttgtcg 
tcgaag 2635 

NtADC2 



g. 2639 

710 ******** 
* * * * *★* 

720 
NtADCl 

ttgtctgtttttgaataatacccttagttggtgatgt 
ttttct 



T 



GA 



c 



2678 



NtADC2 



aataata 



2682 
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M. aylvaatrla 



A. MtlW 

I. coil 



w«a«M*w(KVMH--nKMWUMaiH4)aniWTUIXaiUJIDT«Jn, 442 

ATvn«>vuraAvs3rT-TK~-aoiu«viujosrvuijanuujumuik 443 

AIVaiIK»UriAVS9TT-TA~«0KUIVIU«srvuuaWMimuq, 443 

AXvamsvurtAviiuK-fii— vHOkrKDzgruxc- 

434 



ATV8HH9V LX FEAVQTS*- TW*4QGM f AUNflT UUUI8 TOTL 

nn wuim ti hi ■■■ — ■■11 1 



437 



AVTMonvLvnazwnitiimrvTAiMOuiuu, -gmwnvQOi 



A. nUt« 




US 
JSC 
ISC 



L. CSOttlCBtXai 



L. (KUlntui 



•XCD 41 

■ *M i j m rPTcvtYi mQ iJco ic vi-cn3)ocuiirowkvi>— -^ito 




rVlXAXOMDIVfcT-" 



mtuimxrcmHraoLr ri 
~ thvioi xrnv runoasgun 



WUBUflMW 
nfXVUVrTlXTBUHOUOU'PI 



— •YHvaLnrTavroruaooirrx 



■uzrravfmygnavrn 
TuvrsufrDTwuflKLrm 



vrzHUJiiuvvtaxLSDLTCDsixiiasiinocu iulhsuunob 

vriHiXBUirw»xuDLTCB»aK2Dxrxg<»« — skruauuNoa 

VfXH»U«QMSMaXUOt.TCM06«I1IKIieeil 

ILFUBIMSKM »W 

Krvs*uii»TKuTiOTirrcBSDtucviiicrxOT 
ULxei^ruuvuutOBaaiiHnDcoauTTiiiiinTBfnr 



524 
427 



11* 
444 
SOT 



S74 
»74 



III 

101 
3M 



CK^----TTU«WUimTIXAlMU0fUaOP8VVItVV03OIAII» 
COCDCOJ1-TT USKT LOGAYXXAXGOUOOrOQ rWV LAVBOSU3 IW3 f AYT 

oaaoocK-TTUSKriawTt^Ax^uotLxoorsvLiivsonisrKarAVT 

o»-TTUQiruwTi^jufla\otviJ«wivv»v» oa i» iw AVT toi 
•~n»kmiMmniAWSvwiuwmvivBonirnTm 

G --TTVAVU.TGATOEALAMCH* U1UPSLVRWCTG 



■II 
•23 



A. Mtlv* 




rrxau«nanu 



■ngxvKwuujoMH 




L. •MalUtlB 



H. vylvwatrl* 
L. •aculantu» 
A. tt»ll*t* 



uvMPscjuTvuuuoanLKrtrucxAAXxrL — 
CAvivrscAxvLAAitaranuffrruoiAAxtTVH... 
CAvnxncAwuuuoaKuinTijeauuCTVH-.-. 



mo 

MVMIMDVIAVI«KnUVXTUDCk>G«yv-< 



-AAUABTTULXaTVtTO— 



-VxgDXisvXIXAAM 

uxsainvAiM«r^L&rKTU.rcTiLOovx -urDLwuujQQ — 



p -nanrriVArAscCTTAVTAK 



!^A«IAUtAOI-naaCITLVTiaSCCLTA.AAMMeCT-TTOISIIITVG 
LATASLWLW-n«*OTLVTWMCCLTA-»A»D^ 



«4t 

971 
♦41 



704 
715 
713 



721 
734 
733 



H. aylvaatxla 
L. •«cul*atta* 
A. tbilUM 



IPTPOTIt3CI»5ia/ JW i miUl !MTVVpA» v rV I.UA J«»TOPVXC«»Ofc 3t3 

iDTixiTitari>xtxrwTCLciTAjTvvwv>r»cnAj(wnir/ic»i#c» ata 

«L0^TASTVVgAVUVC9M0(V«K»VXC«U«>. 3(1 



A. MftlT* 



ximicncscDnivvvTOurrAAAvvmvoevem-avDtH 



VPTXOTAJ-Qfl^*V>rrCLitlTA»firMAIBOACtPCLWOTVIT»»CA 



313 
347 
313 
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LA Mutant 



Wild-typo 




12 24 hr after topping 

AOC 

p-ATPase 



12 24 hr after topping 
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pOOCl 



ODC2 CCTACCCCTT CAACAGCTAT TTCTCTAAAA AAAAAAAAAA AAGAAGAAAA TACTACGTAG ATTACACAAT ATTATCAGTA GTAGTATCAC U'MLUJUl 

ODC1 AAAGTGAGTT TCACCCAATA TGAGCGTGTG AAAAGCCCAA AAAAACACTT U1J M1 ATT TTTTTTTATT TCCTCCAAAA AACACATTTT AACGTATTTT 

-3 5 TATA box 6S 
ODC2 T OmagJU T GA7AAACATT TTTACAGGTT TCCCCtfTCTC AAfcffOTIjrfc JtfUfflUATAT TCATATTATT GAATCCCTAO 1 1 1L I I I 1L1 TTCCCTTTCA 
ODCI TAACCACATT TT GC1CLJ I I CCTTTCCGGC CTGGTATTCQ ATTCCTTTAA CAATOOCTTC AACCATOQTO GCAACC1ATT CTCGTTTTCT TTCCCTTTCA 

« 165 

ODC2 TTCCTTCCTC TCATTTACCT CTCTCTTTTC 1 1CL1 HU1 l TGGATGGCCG GCCAAACAAT CATCCTTTCC GGGTTGAACC COGCCCCCAT TCTTCACTCC 

ODCI TTCCTTCCTC TCATTTACCT CTATCTTTTC TTACTTTCTT TGGATGGTCG GCCAAACAAT CAT C C T T TCC GGGTTGAACC CAGCGGCCAT ILllLA CTqC 

P™ 02 MA GQTI I V S G L H P A A I LOS 

pODCl V c 

1M 265 

ODC2 ACAATTGGCG GCGGAGCTTC TCCTACAGCO GCGGCGGCGG CSGAAAACGQ CACCAGAAAA CTCATCCCTC TCTCAAGAGA TGCCTTACAA GATTTCATGT 

ODCI ACAATTGGCG GCGGAGCTTC TCCTACAGCG GCGGCGGCGG CGGAAAA CG A CA C CAGAAAA GTCATCCCTC TTTCAAGAGA TGCCCTACAA GATTTCATGT 

TIG GGAS P T A AAA A I H G T R X VIP LSRO A t Q DTK 

pODCl D 



266 



365 



ODC2 TATCAATCAT A fcCCCAAAAA TTACAAGATG AGAAACAACC TTTTTACGTG CTAGACTTGG GTGAGGTTGT TTCTCTTATG GACCAATGGA AATCTGCTCT 
ODCI TATCAATCAT AACTCAAAAA TTACAAGATG AGAAACAACC TTTTTACGTG CTAGATTTGG GTGAGGTTGT TTCTCTTATG GACCAATGGA AATATGCTCT 
POOC2 L 5 I I TQK L Q D IKQP F Y V L D L GIVV S L M DQW X S A L 

If 



366 



465 



ODC2 CCCAAATATC CGTCCATTTT ACGCTGTTAA ATGTAACCCT GAACCGTOST TCCTTTCAAT TTTATCTGCT ATGGGCTCAA ATTTTGATTG TGCTAGCCGA 

ODCI CCCAAATATC CGTCCATTTT ACGCTGTTAA ATGTAACCCT GAATCGTTGT TCCTTTCAAT TTTATCTGCT ATGGGCTCAA ATTTTGATTG TGCTAGCCGA 

PODC2 PHI RPT Y A V X CKP IPS fLSI LSA KGS NFDC A S R 

pODCl S L 



466 



565 



ODC2 GCTGAAATTG AGTAT GTTTT ATCTCTTGGC ATTTCACCTO ACCOTATTOf TTTCGCAAAT CCATOCAAAC CGGAATCCGA TATTATTTTT GCAGCAAAAG 

ODCI GCTGAAATTG AGTATGTTTT ATCTCTTGGC AA T AAQAAAA GASAOAGOTC AATOGGTTAC TTOATTTOAT GAAAGTTTCG GAAATTAATA TTGGGGTTGT 

PODC2 All I Y V L S L G ISP DRIV F A H PCK P£SD I I F AAX 

pODCl K X X R I R S MGT LI. 



566 



66S 



ODC2 TTGGGG TGAA TCTTACAACC TATCATTCTG AAGACGAGGT TTACAAGATC CGAAAGCATC ACCCGAAATC CGAACTCTTG CTCCGCATCA AGCCCATGCT " 

ODCI CTTCGTATCG TCATGGGAAT CTTTAGCTGA AGTTATAACA AATTT GG AGG AGTTTCTCTT AAAAATTTGG ATTAAAATCG TGCTTGGAAC AAGAACACAC 

POOC2 V G V H LTT YDS I D I V TKI R K H HPKS ILL LRI KPKL 



CGACGGCAAC GCGAGATGCC CAATGGGCCC GAAATACGGC GCGCTTCCAG AAGAAGTCGA CC C G C T GC T C CGGGCAGCTC AAGCCGCCCG TCTCACCGTA 
ATGAATAAAG CGAAGAACAC CAAGACCACT GATTTCCAAA ACACCAAATT TCAATTTTTT TAAACGTTTT LIUUfU.f TGGGTGTAAA TTAAGCTTTT 



ODCZ 

ODCI 

J>ODC2 DON ARC PMGP KYG ALP E I V 0 P L L A A A Q A A r"l T* 

7 « 665 

ODC2 TCCGGCGTCT CATTCC ACAT CGGTAG CCGA GATGCCGATT CAAACGCTTA TCTCGGCGCC ATAGCCGCGG CTAAGGAAGT GTTTGAAACA GCTGCTAAAC 

ODCI CTTTTCTTTT TTAGAATGTT ATTTTTATTT TATTTATTAA ATAGATTTAA CATAGTTTTT TTTACTCAAA ATAATATATG TCATTTTTTT ATTCGTCACT 

PGDC2 SGV SFHI GSG DAD SHAY LGA I A A A K I V TIT A A K 

"< 965 

ODC2 TCGGGATGTC GAAAATGACT GTTCTAGACG TCGGCGGCGG GTTTACATCC GGCCACCAGT TCACAACCGC CUJU-TCU X GTTAAATCAG CTTTAAAACA 
ODCI CGCCACGTCA GCAGCGAGTG CATTGCACAA ACTTTGTAAG TTTGGCTGAT TGTTAAATAA GTGCTCAATT GGAACAAAGT TCATGTAACG TATGTGCTCA 

PODC2 L G M 3 X H T VLD VGGG YTS G H Q F T T A A V A VKS ALKQ 

»« 1065 

ODC2 ACACTTCGAT GACGAACCGG AGTTGACAAT CATAGCTGAA CCGGGTCGGT TTTTTGCAGA GACGGCGTTT ACTTTGGCAA CGACGATTAT AGGGAAAAGA 

ODCI ATAGGAACTC TCTTAAGTTT AGGTGTCTAA ATGAAAGATC GTGCCAACTT TAAGTGTCTC CGTATGTATT CAGCCAAAAT AATGTAAGCC AAATGTAGTC 

PODC2 HTD DIP ELTI IAI PGR F F A I T . A F T LA TTII GKR 

ODC2 GTGAGGGGTG AATTGACGGA GTATTGGATT AACGACGGGC TGTACGGTTC CATGAACTGT GTACTTTACG ACOTGCGAC GGTGAATGCA ACGCCGTTAG 

ODCI AATAAAGCGA TCGTGCTAGA ACCACGGGAC TCAGGGAATG CCTTACACCT TCTCCCCGGT CAACAGAATT CCTTACTCGG AGTTTGTTTT CGAAGACCAA 

pODC2 VRG ELRE YWI NDG LYGS KMC VLY D H A T V H A TPL 



1166 



1265 



0DC2 CTGTTCTGTC GAATCGTAGT AACGTTACCT GCGGCGGGTC GAAAACGTTT CCGACGACTG TGTTTGGGCC CACTTCTCAT GCTCTTGATA CTGTTTTAAG 
ODCI TAATAATAGA GTGAAACCTT CCTTTGAATA GGGATTCAAA AAAAAGGTGA CTTGGAACAC CAGCAAAAAT TAATTCCTAG TGGCGACACT GTAAATAAAA 
PODC2 AVL5 HRS HVT CGGS RTF PTT V F G P TCD ALD TVLR 

1266 136S 

ODC2 GCATTACCAG TTACCGGAGC TGCAGGTTAA TGATTGGCTG GTTTTTCCTA ATATGGGTGC TTATACTAAA GCTGCTCCGT CCAATTTTAA TGGATTTAAT 

ODCI TAATCCCTAT TTCAAATTTG TCACTTTAAT TGGAAAAACT CTTTCACCCA CAATCCATAA CAACACATTA TCTTTTGGAG GTGTAAAAAG GTGATGTGAC 

P 00 " 0 T Q LPI LQVN DWL VF P NKGA YTK A A G SNTN G F N 

"« 1465 
ODC2 ACTTCCCCCA TTGTTACTCA CCTCGCTTAT TCTTATCCAA GCTGATGAAC CACCTGTATT AGGAATTACT ACCGTGGTTT TGATGGTTTT TTCCTTTTTT 
ODCI AGCTCTAGCA ACTCTGCTGG GGGCTATTAA TAAGAATTCG ACCTTTCTAT ATTGATTTTT ATTTGGCTTT TATCATGTCT TGGATATTAT TGTGTTTGCG 
PODC2 TSA IVTH LAV SYP S. 

l1€< Po ly A aign al 1565 

ODC2 GGGTATCTTT TTTTTAATTT TGTT I/1T1TT GGTAGTAATT TATATTCCAA ATCAGCTTGT AATTCTCTTG TATGCCf^^^^^rGCAAGG A TTT GC TAAT 

ODCI AGCATAATGT TCTATTTGTC TCTTATTTAT CGCTTTAATA GTTATTTAAA CTGTGATATA AATTGTATCC TATCTCGCAC CCCTCTGAGT C TTCTGATAG 

1566 ^Poly A iltc 1665 

ODC2 TCTGATTTTC TCTAA TATGG AAGTTTTTAA AATTAGTTTA AGAAACATAA TGGGTAAAAG GTTTGTGGGG TCATGATATT TCTGTGACTA TAAAAGCATC 
ODCI GTAGTTATGT TGTGTTTGCC TACCAGCATC ATAATATTTC TGTCTTGAGA TAAAGCCAGT TAGCCTACCA GCTTTTGGTG AAGGATTTAA TCACATATGT 
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1 

tabacum cv 
tabacum cv 
tabacum cv 
stramonium 
esculentum 
cerevisiae 
sapiens 




MSSTQVGNALSSSTTTLVD] 
MN|F — GN 



[etlhnlllelk|nqd 



N. 
N. 
N . 
D. 
*L. 
S. 
H . 



51 100 
tabacum cv Xanthi 
tabacum cv BY2 
tabacum cv SC58 
stramonium 

esculentum 

cerevisiae LELL jHEQAH PKI FQ^|KARI GR|jNMETCDPG§ENS §FI C^fiEfflKRflFN 

sapiens EEFDCH — FLDEGFTa-KBlLDQK^EVSSSBMD AHHA mBDILKKHL 




101 

tabacum cv Xanthi 
tabacum cv BY2 
tabacum cv SC58 
stramonium IIS^A* 
esculentum HflNSl 
cerevisiae NHVK1 
sapiens RgL: 



150 




151 

N. tabacum cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae 

H. sapiens 



200 




201 

N. tabacum cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S . cerevisiae 

H. sapiens 



250 




251 

N. tabacum cv Xanthi 

N, tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae SiFTSLY! 

H. sapiens TfPETFVi 



300 




301 

N. tabacum cv Xanthi • 
N. tabacum cv BY2 



350 
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N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S . cerevisiae STAVL; 

H. sapiens 

351 

N. tabacum cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae KggS|gNEAMIYT| 

H. sapiens TGSDDED|ss|qtfMY 

401 

N. tabacum cv Xanthi 
N. tabacum cv BY2 
N. tabacum cv SC58 
D. stramonium 
L . ' esculentum C 
S. cerevisiae 
H. sapiens 

451 

N. tabacum cv Xanthi 
N. tabacum cv BY2 
N. tabacum cv SC58 
D. stramonium Hjl! 
L. esculentum Hi! 
S. cerevisiae HFYl 
H. sapiens §Ml| 

501 

N. tabacum cv Xanthi 
N. tabacum cv BY2 
KT . tabacum cv SC58 

stramonium 

i. esculentum 

. cerevisiae 

. sapiens 




JDIVYIDSELD- 
-QRPTIYYVMSGPAWELMQQFQNPDFPPE 

533 
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ODC 



26S rRNA 



Et Br 




B 



Low Alkaloid Wild-type 
0 4 12 24 . 0 4 12 24 hr 



ODC 

TBG-1 

PMT 

28S rRNA 
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SEQUENCE LISTING 

<110> Timko P, Michael 

<120> Regulation of Gene Expression in Tabacco for 

Manipulation of Plant Growth and Secondary Metabolism 

<130> 4981*239 

<140> 
<141> 

<160> 26 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 1120 

<212> DNA 

<213> Plant 

<400> 1 

ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 
aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 12 0 
catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 18 0 
tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 
caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 
agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 
gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 
aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 48 0 
tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 
aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactccat accaaccgaa 600 
gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 
agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 
cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 
agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 
tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 
ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 
ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 
ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 
atactttctc ttctccaatt tgtttagttt aattttgaaa 1120- 

<210> 2 
<211> 3091 
<212> DNA 
<213> Plant 

<400> 2 
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ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 
aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 120 
catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 180 
tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 
caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 
agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 
gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 
aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 
tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 
aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactctat accaaccgaa 600 
gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 
agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 
cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 
agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 
tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 
ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 
ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 
ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 
atactttctc ttctccaatt tgtttagttt aattttgaaa atggaagtca tatctaccaa 1140 
cacaaatggc tctaccatct tcaagagtgg tgccattccc atgaatggcc accataatgg 1200 
cacttccaaa caccaaaacg gccacaagaa tgggacttcc gaacaacaga acgggacaat 1260 
cagccttgat aatggcaacg agctactggg aaactccaat tgtattaagc ctggttggtt 1320 
ttcagagttt agcgcattat ggccaggtta gtactgagaa agaaactcaa atgcatattt 1380 
aaagttaaaa ttgttaggct aatataagga gttgatattc ttttagtgat taattaaaaa 1440 
ggaaaaagta tcaaataaat tcaaaaaatg gatagtaact tcgcatatta ctctacacat 1500 
taatttgaaa taaatcgaat tttgcaggtg aagcattctc acttaaggtt gagaagttac 1560 
tgttccaggg gaagtctgac taccaagatg tcatgctctt tgaggtaaat aatattttaa 1620 
tacacatgct tccatttaaa ttgatacttt taatttactt ttactttatt gcatgtgtac 1680 
gtacagtcag caacttatgg gaaggttctg actttggatg gagcaattca acacacagag 1740 
aatggtggat ttccatacac tgaaatgatt gttcatcttc cacttggttc catcccaaac 1800 
ccaaaaaagg ttttgatcat cggcggagga attggtttta cattattcga aatgcttcgt 1860 
tatcctacaa tcgaaaaaat tgacattgtt gagatcgatg acgtggtagt tgatgtaagt 1920 
caaacttctt ttactcacat aaaaaaatgg tttagattgc ttcttgttat ttttctaaaa 1980 
gaatactatt tttttaaaac aaaattttct tttttacagg tatctagaaa atttttccct 2040 
tatctcgctg ctaattttaa cgatcctcgt gtaaccctag tccttggaga tggtgcgtat 2100 
ttgataatct cgcttttgtt ttatctttta tttttattgc atttaatttt taccttttgg 2160 
tgtgtggtta attcacctgc cattggttct ctttcatttc aggggctgca tttgtaaagg 2220 
ctgcacaagc agaatattat gatgctatta tagtggactc ttctgatccc attggtactc 2280 
tattacttct taataccaag actaatctta ttgaataagc tactaataaa cggtaattga 2340 
tttctaaaac aatataattt caggtccagc aaaagatttg tttgagaggc cattctttga 2400 
ggcagtagct aaagccctaa ggccaggagg agttgtatgc acacaggctg aaagcatttg 2460 
gcttcatatg catattatta agcaaatcat tgctaactgt cgtcaagtct ttaagggctc 2520 
tgtcaactat gcttggacta ctgttccaac atatccaacg tatttttctc tctctctctc 2580 
ttcctataaa attggaagtt ttgattctat aattgtcaag aaatggagaa tcagttccaa 2640 
gaaaaaccaa cttcttttct tttaetcttc aaggtattgt gtttaatttt ttttcaactg 2700 
atatgatcaa ttattttgat ttcagcggtg tgattggtta tatgctctgc tctactgaag 2760 
gaccagaaat tgacttcaag aatccagtaa atccaattga caaagagaca gctcaagtca 2820 
agtccaaatt agcacctctc aagttctaca actctgatgt aacttcatat ctcacaattt 2880 
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cttttttcct attgtacttt atgttcttcg 

gtcttttttt ttttcagatt cacaaagcag 

tgatcgagtc ttaatcaact gattaatgaa 

aataagtgaa agacgtattg tatgagaatt 
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tcaaatttta taattaactc ttttcaaatt 2940 

cattcatttt gccatctttc gccagaagta 3000 

tactggtggt acaatcattg gaccaagatc 3060 

c 3091 



<210> 3 
<211> 353 
<212> PRT 
<213> Plant 

<400> 3 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Ser 
1 5 10 15 

Gly Ala lie Pro Met Asn Gly His His Asn Gly Thr Ser Lys His Gin 
20 25 30 

Asn Gly His Lys Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr lie Ser 
35 40 45 

Leu Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Cys lie Lys Pro 
50 55 60 



Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser 
65 70 75 80 

Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp 
85 90 95 

Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp 
100 105 110 

Gly Ala lie Gin His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met 
115 120 125 

lie Val His Leu Pro Leu Gly Ser lie Pro Asn Pro Lys Lys Val Leu 
130 135 140 

lie lie Gly Gly Gly lie Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr 
145 150 155 160 

Pro Thr He Glu Lys He Asp He Val Glu He Asp Asp Val Val Val 
165 170 175 

Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Asn Asp 
180 185 190 

Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala 
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195 200 205 

Ala Gin Ala Glu Tyr Tyr Asp Ala He He Val Asp Ser Ser Asp Pro 
210 215 220 

He Gly Pro Ala Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu Ala Val 
225 230 235 240 

Ala Lys Ala Leu Arg Pro Gly Gly Val Val cys Thr Gin Ala Glu Ser 
245 250 255 

He Trp Leu His Met His He He Lys Gin He He Ala Asn Cys Arg 
260 265 270 

Gin Val Phe Lys Gly Ser Val Asn Tyr Ala Trp Thr Thr Val Pro Thr 
275 280 285 

Tyr Pro Thr Gly Val He Gly Tyr Met Leu Cys Ser Thr Glu Gly Pro 
290 295 300 

Glu He Asp Phe Lys Asn Pro Val Asn Pro He Asp Lys Glu Thr Ala 
305 310 315 320 

Gin Val Lys Ser Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser Asp He 
325 330 335 

His Lys Ala Ala Phe lie Leu Pro Ser Phe Ala Arg Ser Met He Glu 
340 345 350 

Ser 



<210> 4 
<211> 711 
<212> DNA 
<213> Plant 

<400> 4 

gaattcaatg gagaaggaaa 
aataatctct atcattcaag 
aagtaatagg tgatcaacag 
tatgctgaag aacccacata 
gagctttatt tttaaagagt 
aaattttact attatttgtt 
agaggaagaa aggagacaga 
tcaaagaaaa tttttaaaat 
actaataatt gcaccgagac 



atatttccag tgtaaacaca 
ccttaggtgg agattaaaaa 
ctttcgtaaa acgtcattag 
aggaagatca taaaatacat 
ggctagctgg tcagcaaaga 
ttaagaggga gatggggcac 
agaggaaata gatttggggg 
ggagagagaa atgagcacac 
aaacttatat tttagttcca 



agtgaatgaa gagaagccaa 60 
aattatttac tttcttatca 120 
gagaatatta taatctcttt 180 
gactttcaga tgacttcttg 240 
ggtgctcgtc agatatcata 300 
acatgcttgt gacaaaagta 360 
gggggggggg ggtttcacaa 420 
acatatacta acaaaatttt 480 
aaatgtcagt ctaaccctgc 540 
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acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 

tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 

tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa a 711 

<210> 5 
<211> 3129 
<212> DNA 
<213> Plant 

<400> 5 

gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 

aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 

aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 

tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 

gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 

aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 

agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 

tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 

actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 

acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 

tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 

tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 

atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 

caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 

aacgggcacc agaatggcac tttcgaacat cggaacggcc accagaatgg gacatccgaa 900 

caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 

attaagcctg gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 

aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 

actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 

cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 

gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 

tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 1320 

atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 1380 

atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 

tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 

tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 

atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 

tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 

caggtatcca gaaaattttt cccttatctg gcagctaatt ttaacgatcc tcgtgtaacc 1740 

ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 

tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 

gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 

cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 

aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 

ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 

taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 

taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 2220 

aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 
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gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt agatcctaaa 2340 

attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca tttactattc 2400 

tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat ttttatattt 2460 

taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata aaaaagattt 2520 

atagtcaaat aaatgttgtg accatataaa aacctccgca tttttaagat cataagtttc 2580 

agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt atgtcataaa 2640 

aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat tgtgtttaat 2700 

ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg tgatcggtta 2760 

tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa atccaattga 2820 

caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca actctgatgt 2880 

aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc aaattttata 2940 

actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc attttaccat 3000 

ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg gtagtacaat 3060 

cattggacca agatcgagtc ttaatcaagt gaataaataa gtgaaatgcg acgtattgta 3120 

. ggagaattc 3129 

<210> 6 

<211> 375 

<212> PRT 

<213> Plant 

<400> 6 

Met Glu Val He Ser Thr Asn Thr Asn Gly Ser Thr He Phe Lys Asn 
1 5 10 15 

Gly Ala He Pro Met Asn Gly His Gin Asn Gly Thr Ser Glu His Leu 
20 25 30 

Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His Gin Asn 
35 40 45 

Gly Thr Phe Glu His Arg Asn Gly His Gin Asn Gly Thr Ser Glu Gin 
50 55 ' ' 60 

Gin Asn Gly Thr He Ser His Asp Asn Gly Asn Glu Leu Leu Gly Ser 
65 70 75 80 

Ser Asp Ser lie Lys Pro Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp 
85 90 95 

Pro Gly Glu Ala Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly 
100 105 110 

Lys Ser Asp Tyr Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly 
115 120 125 

Lys Val Leu Thr Leu Asp Gly Ala He Gin His Thr Glu Asn Gly Gly 
130 135 140 
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Phe Pro Tyr Thr Glu Met lie Val His Leu Pro Leu Gly Ser He Pro 
145 150 155 160 

Asn Pro Lys Lys Val Leu He He Gly Gly Gly He Gly Phe Thr Leu 
165 170 175 

Phe Glu Met Leu Arg Tyr Pro Ser He Glu Lys He Asp He Val Glu 
180 185 190 

He Asp Asp Val Val Val Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu 
195 200 205 

Ala Ala Asn Phe Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly 
210 215 220 

Ala Ala Phe Val Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala He He 
225 230 235 240 

Val Asp Ser Ser Asp Pro He Gly Pro Ala Lys Asp Leu Phe Glu Arg 
245 250 255 

Pro Phe Phe Glu Ala Val Ala Lys Ala Leu Arg Pro Gly Gly Val Val 
260 265 270 

Cys Thr Gin Ala Glu Ser He Trp Leu His Met His He He Lys Gin 
275 280 285 

He He Ala Asn Cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala 
290 295 300 

Trp Thr Thr Val Pro Thr Tyr Pro Thr Gly Val He Gly Tyr Met Leu 
305 310 315 320 

Cys Ser Thr Glu Gly Pro Glu Val Asp Phe Lys Asn Pro Val Asn Pro 
325 330 335 

lie Asp Lys Glu Thr Thr Gin Val Lys Ser Lys Leu Gly Pro Leu Lys 
340 345 350 

Phe Tyr Asn Ser Asp He His Lys Ala Ala Phe He Leu Pro Ser Phe 
355 360 365 

Ala Arg Ser Met He Glu Ser 
370 375 



<210> 7 
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<211> 1134 
<212> DNA 
<213> Plant 

<400> 7 

gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 

ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 

acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 

gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 

agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 

tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 

aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 

aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 

gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 

ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 

caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 

atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 

tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 

gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 

cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 

tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc . 960 

atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 

gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 

aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaa 1134 

<210> 8 
<211> 3269 
<212> DNA 
<213> Plant 

<400> 8 

gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 

ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 

acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 

gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 

agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 

tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 

aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 

aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 

gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 

ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 

caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 

atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 

tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 

gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 

cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 

tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 

atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 
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gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 

aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaaatggaa 1140 

gtcatatcta ccaacacaaa tggctctact atcttcaaga atggtgccat tcccatgaac 1200 

ggttaccaga atggcacttc caaacaccaa " aacggccacc agaatggcac ttccgaacat 1260 

cggaacggcc accagaatgg gatttccgaa caccaaaacg gccaccagaa tggcacttcc 1320 

gagcatcaga acggccatca gaatgggaca atcagccatg acaacggcaa cgagctacag 1380 

ctactgggaa gctccaactc tattaagcct ggttggtttt cagagtttag cgcattatgg 1440 

ccaggttagt actaagaaag aaactcaaat gcatcgtact cttgtattct gctttgcgta 1500 

taatttagat gatggtgttt gactaagcac tgagtttaaa aataaaaagt ttaaagttaa 1560 

attgttacta tagagagcta tatctttagg aactaactaa aaaggaaaaa ttatcacata 1620 

aaattgggat gaagtaagca gttaacttcg catattattc gacacattaa tttgaaataa 1680 

atcgaatttt gcaggtgaag cattctcact taaggttgag aagttactat tccaggggaa 1740 

gtctgattac caagatgtca tgctctttga ggtaattaat taatactaat agtcaagctc 1800 

atgtatgatt atatttaaag tggtattttt cgtttatttt taatttattg cacgtgtacg 1860 

tacagtcagc aacatatggg aaggttctga ctttggatgg agcaattcaa cacacagaga 1920 

atggtggatt tccatacact gaaatgattg ttcatcttcc acttggttcc atcccaaacc 1980 

ctaaaaaggt tttgatcatc ggcggaggaa ttggttttac attattcgaa atgcttcgtt 2040 

atcctacaat cgaaaaaatt gacattgttg agatcgatga cgtggtagtt gatgtaagtc 2100 

aaacttcttt tactcacata aaaaaatgat ttagattctt atttttctaa aagaattaaa 2160 

acaaaatttt ccgttttaca ggtatctaga aaatttttcc cttatcttgc tgctaatttt 2220 

agcgatcctc gtgtaaccct agtccttgga gatggtgcgt atttgataat ctcgttttta 2280 

ttttatcttt tacttttatt ttatttaatt tttacctttt tgtgtgtggt taattcacct 2340 

gccattggtt ctttttattt caggggctgc atttgtaaag gccgcacaag caggatatta 2400 

tgatgctatt atagtggact cttctgatcc cattggtact ctattactac ttaataccaa 2460 

gactattctt attaaataag ctactaataa acgtaactct gatagttttc taaaataata 2520 

taatttcagg tccagcaaaa gacttgtttg agaggccatt ctttgaggca gtagccaaag 2580 

ccctaaggcc aggaggagtt gtatgcacac aggctgaaag catttggctt catatgcata 2640 

ttattaagca aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactatgctt 2700 

ggactactgt tccaacatat ccaacgtatt tttctctctc tcttcctata aaattggaag 2760 

ttttgattct ataattgtca agaaatggag aatcagttcc aagaaaaacc aaattctttt 2820 

cttttactct tcaaggtgtg tttaagtttt ttaaactgat actgatcaat tattttgatt 2880 

tcagcggtgt gattggttat atgctctgtt ctactgaagg accagaagtt gacttcaaga 2940 

atccagtaaa tccaattgac aaagagacaa ctcaagtcaa gtccaaatta gcacctctca 3000 

agttctacaa ctctgatgta acttcatatc tcaatttctt ttttcttatt gtactttatg 3060 

ttcttagtca aattttataa ttaactcttt tcaaattgtc tttttttttc agattcacaa 3120 

agcagcattc attttgccat ctttcgccag aagtatgatc gagtcttaat caagtgacta 3180 

atgaatactg gcggtacaat cattggacca agatcgagtc ttaatcaagt gaataaataa 3240 

gtgaaatgcg acgtattgta taagaattc 3269 

<210> 9 
<211> 381 
<212> PRT 
<213> Plant 

<400> 9 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
15 10 15 
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Gly Ala lie Pro Met Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin 
20. 25 30 

Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin Asn 
35 40 45 

Gly He ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu His 
50 55 60 

Gin Asn Gly His Gin Asn Gly Thr He Ser His Asp Asn Gly Asn Glu 
65 70 75 80 

Leu Gin Leu Leu Gly Ser Ser Asn Ser lie. Lys Pro Gly Trp Phe Ser 
85 90 95 

Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser Leu Lys Val Glu 
100 105 110 

Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp Val Met Leu Phe 
115 120 125 

Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp Gly Ala He Gin 
130 135 140 

His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met He Val His Leu 
145 150 155 160 

Pro Leu Gly Ser He Pro Asn Pro Lys Lys Val Leu He He Gly Gly 
165 170 175 

Gly He Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr Pro Thr He Glu 
180 185 190 

Lys He Asp He Val Glu He Asp Asp Val Val Val Asp Val Ser Arg 
195 200 205 

Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Ser Asp Pro Arg Val Thr 
210 215 220 

Leu Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala Ala Gin Ala Gly 
225 230 235 240 

Tyr Tyr Asp Ala He He Val Asp Ser Ser Asp Pro He Gly Pro Ala 
245 250 255 

Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu Ala Val Ala Lys Ala Leu 
260 265 270 
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Arg Pro Gly Gly Val Val Cys Thr Gin Ala Glu Ser lie Trp Leu His 
275 280 285 

Met His lie He Lys Gin He He Ala Asn Cys Arg Gin Val Phe Lys 
290 295 300 

Gly Ser Val Asn Tyr Ala Trp Thr Thr Val Pro Thr Tyr Pro Thr Gly 
305 310 , 315 320 

Val He Gly Tyr Met Leu Cys Ser Thr Glu Gly Pro Glu Val Asp Phe 
325 330 335 

Lys Asn Pro Val Asn Pro He Asp Lys Glu Thr Thr Gin Val Lys Ser 
340 345 350 

Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser Asp He His Lys Ala Ala 
355 360 365 

Phe He Leu Pro Ser Phe Ala Arg Ser Met He Glu Ser 
370 375 380 



<210> 10 
<211> 469 
<212> DNA 
<213> Plant 

<400> 10 

gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 

ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 

cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag. agagaaatga 180 

gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 

attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 

ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 

catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 

tgaaacataa tactttctct tctccaattt gtttagttta attttgaaa 469 

<210> 11 
<211> 3001 
<212> DNA 
.<213> Plant 

<400> 11 

gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 

ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 

cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 

gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 . 

attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 
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ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 

catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 

tgaaacataa tactttctct tctccaattt gtttagttta attttgaaaa tggaagtcat 480 

atctaccaac acaaatggct cgaccatctt caagaatggt gccattccca tgaatggcca 540 

ccagagtggc acttccaaac acctcaacgg ctaccagaac ggcacttcca aacaccaaaa 600 

cggccaccat aatggcactt ccgaacatcg gaacggccac cagaatggga tttccgaaca 660 

ccaaaacggc caccagaatg ggacttccga acatcggaac ggccaccaga atgggatttc 72 0 

cgaacaccaa aacggccacc agaatgggac ttccgaacac caaaacggcc accagaatgg 78 0 

gacttccgaa caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaaa 840 

ctccaactct attaagcttg gttggttttc agagtttagc gcattatggc caggttagta 900 

ctgagaaaga aactcaaatt catatttaaa gttaaaattg ttaggctaat ataagaagtt 960 

gattttcttt tagtgattaa ttaaaaaagg aaagagtatc aaataaattc caaaaaatga 1020 

ccagtaactt cgcatattat tctacacatt aatttgaaat aaatcgaatt ttgcaggtga 1080 

agcattctcc cttaaggttg agaagttact atttcagggg aagtctgact accaagatgt 1140 

catgctcttt gaggtaaata atattctaat acacatgctt taatatgaat aaatactttt 1200 

aatttacttt tagtttattg cacgtgtacg tacagtcagc aacatatggg aaggttttga 1260 

ctttggatgg agcaattcaa cacacagaga atggtggatt tccatacact gaaatgattg 1320 

ttcatcttcc acttggttcc atcccaaacc caaaaaaggt tttgatcatc ggcggaggaa 1380 

ttggttttac attattcgaa atgcttcgtt atcctacaat cgaaaaaatt gacattgttg 1440 

aaatcgatga cgtggtag.tt gatgtaagtc aaatttcttt tactcacata aaaaaatgat 1500 

ttagattgct tctttttatt tttctaaaag aataaatata ttctctctta gttttaaaca 1560 

aaattctctt tcttacaggt atctagaaaa tctttccctt atctcgcagc taattttaat 1620 

gatcctcgtg taaccctcgt tctcggagat ggtgcgtatt tataatctcg tttttgtttt 1680 

atcttttatt tttatttcat ttaatttacc tttttgtgtg tggttaattt acccgt'catt 1740 

ggttctcttt catttcaggg gctgcatttg taaaggctgc acaagcagga tattatgatg 1800 

ctattatagt ggactcttct gatcccattg gtactctatt actacttaat accaagacta 1860 

atcttattga ataagctact aataaactgt aattgatttc taaaataata taatttcagg 1920 

tccagcaaaa gatttgtttg agaggccatt ctttgaggca gtagccaaag ccctaaggcc 1980 

aggaggagtt gtatgcacac aggccgaaag catttggctt catatgcata ttattaagca 2040 

aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactacgctt ggactactgt 2100 

tccaacatat cccacgtatt ttctctctct ctctcttcat ctttgaaaat tgaaaatcct 2160 

gactactttc cttcctttga ttcctcggtt aaaggggcgt agatcataag attttcaaga 2220 

aatagataat gacgtccaag aaaaactaac ttcttttcat ttactattct ttttggtgac 2280 

aaactttatt tat;tatttcg ttctaaagag aaaatttatt tttatatttt aaaataattt 2340 

tgttttaaac ttttattttt acttattata tctttaataa aaaaattata gtcaaataaa 2400 

tattatggcc acactaaaca tccaagtttt tgaaaccata agttttagag ccaaatgagt 2460 

taatttgttt ttggtatgcg ggtgcggagt caaattatgt cacaaaaatt gtaatggagt 2520 

gagcaaattt ttatttcgag taaactttca aggtattgtg ttaaagtttt ttcaactgat 2580 

actaatcaat tatgtctcaa ccattttgat ttcagtggtg taattgggta tatgctctgc 2640 

tctactgaag ggccagaagt tgacttcaag aatccaataa atccaattga caaagagaca 2700 

actcaagtca agtccaaatt agcacctctc aagttttaca attctgatgt aacttcatat 2760 

ctaacaattt ctttttctgt tttactgtat cttcattgtc aaaattttat aattaactct 2820 

tctcaaatta tctttttttt tagattcaca aagcagcatt cattttgcca tctttcgcca 2880 

gaagtatgat cgagtcttaa tcaagtgaat aatgaacact ggtggtgcaa tcattggacc 2940 

aagatcgagt cttaatcaag tgaataaata agtgaaatgc cgacgtattg tatgagaatt 3000 

c .... 3001 

<210> 12 
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<211> 419 
<212> PRT 
<213> Plant 



<400> 12 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr He Phe Lys Asn 
1 5 10 15 



Gly Ala He Pro Met Asn Gly His Gin Ser Gly Thr Ser Lys His Leu 
20 25 30 

Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His His Asn 
35 40 45 

Gly Thr Ser Glu His Arg Asn Gly His Gin Asn Gly He Ser Glu His 
50 55 60 

Gin Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin 
65 70 75 80 

Asn Gly He Ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu 
85 90 95 

His Gin Asn Gly His Gin Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr 
100 105 110 

He Ser His Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Ser He 
115 120 125 

Lys Leu Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala 
130 135 140 

Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr 
145 150 155 160 

Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr 
165 ' 170 175 



Leu Asp Gly Ala He Gin His Thr 
180 

Glu Met He Val His Leu Pro Leu 

195 . 200 

Val Leu He He Gly Gly Gly He 

210 215 

Arg Tyr Pro Thr lie Glu Lys lie 



Glu Asn Gly Gly Phe Pro Tyr Thr 
185 190 

Gly Ser He Pro Asn Pro Lys Lys 
205 

Gly Phe Thr Leu Phe Glu Met Leu 
220 

Asp He Val Glu He Asp Asp Val 
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225 230 235 240 

Val Val Asp Val Ser Arg Lys Ser Phe Pro Tyr Leu Ala Ala Asn Phe 
245 250 255 

Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val 
260 265 270 

Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala lie lie Val Asp Ser Ser 
275 280 285 

Asp Pro lie Gly Pro Ala Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu 
290 295 300 

Ala Val Ala Lys Ala Leu Arg Pro Gly Gly Val Val Cys Thr Gin Ala 
305 310 315 320 

Glu Ser lie Trp Leu His Met His He He Lys Gin He He Ala Asn 
325 330 335 

Cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala Trp Thr Thr Val 
340 345 350 

Pro Thr Tyr Pro Thr Gly Val He Gly Tyr Met Leu Cys Ser Thr Glu 
355 360 365 

Gly Pro Glu Val Asp Phe Lys Asn Pro He Asn Pro He Asp Lys Glu 
370 375 380 

Thr Thr Gin Val Lys Ser Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser 
385 390 395 400 

Asp He His Lys Ala Ala Phe He Leu Pro Ser Phe Ala Arg Ser Met 
405 410 ■ 415 

He' Glu Ser 



<210> 13 
<211> 1636 
<212> DNA 
<213> Plant 

<400> 13 

ggcacgagat cagatccaat tctcttctgt gcttcccttc tctgctctca aattcttcag 60 

atctacaaag ttttcttcat tttcagaggg cagacatgga aactttcttg ttcacctcag 120 

agtcagtcaa tgaaggccac cccgacaagc tctgcgacca ggtctcggat gcaattcttg 180 
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atgcttgctt agaacaggat ccagaaagca aggttgcatg tgaaacctgc acaaagacaa 240 

acatggttat ggtctttgga gagatcacaa ccaaggccac tgttgactat gagaagatag 300 

tgcgtgacac atgcagaggc attgggttca cctcagcaga tgttggcctt gacgctgaca 360 

actgcaaggt tcttgtcaac atcgagcagc agagccctga cattgcccaa ggtgttcacg 420 

gtcatcttac caagaaacca gaagagattg gagctggtga ccaaggtcac atgtttggct 480 

atgccactga tgaaacccca gagctcatgc cccttaccca tgtttgggcc actaagcttg 540 

gtgccaagct taccgaagtg aggaagaaca agacttgccc atggctcaga ccagatggca 600 

agacccaagt tactgttgag tacaagaacg acaatggtgc catggtccca attagagttc 660 

acactgttct catctcaact caacatgacg aaactgtcac aaacgaccag attgcccagg 720 

acttgaaaga gcatgtgatc aaacctgtga tcccatctca gtaccttgat gagaatacca 780 

tcttccacct caacccatca ggtcgcttcg tcatcggtgg accacacgga gatgctggac 840 

ttaccggcag gaaaattatc attgacacct acggaggctg gggtgcccat ggaggaggtg 900 

ctttctcagg aaaggaccct actaaggtgg acaggagtgg tgcttatatt gttagacagg 960 

cagcaaagag tgtggtcgcc tcaggacttg ctcgccgctg tattgtgcag gtttcttatg 1020 

ctatcggtgt ggctgaacca ctttccgtgt ttgttgacac ttacaagact ggaacaattc 1080 

cagacaagga tattttgact ctgatcaagg agaactttga cttcaggcct ggaatgatgt 1140 

caatcaacct tgacttgtta agaggaggca acttcaggta ccagaagact gcagcttatg 1200 

gtcactttgg ccgtgatgac cccgacttct catgggagac tgtcaaggtc ctcaagccaa 1260 

aagcttaagt gaggtgtagc cttttggcca ttatttttct tgcagaccaa taaacaagct 1320 

tcatcatatc atgcattggt ggcaggagaa gagaatttgt gtctccattg gaggattcta 1380 

tgagctctga gtcattgaac attgttattt ttctttcttt ttttttcacc cttttctgca 1440 

gtaccttatt tttattttgt tactgttaag tagcagtgat ttaagttttc cctgttaagt 1500 

agcagtgatt taagttttcc ctgttaagta gctggaatta agtttccatg ttctatcata 1560 

ttatatgtga acttgtcaat tatctcctga ggtgaaagag tccttcaggg aatagtttaa 1620 
aaaaaaaaaa aaaaaa 1636 



<210> 14 
<211> 390 
<212> PRT 
<213> Plant 

<400> 14 

Met Glu Thr Phe Leu Phe Thr Ser Glu Ser Val Asn Glu Gly His Pro 
15 10 15 

Asp Lys Leu Cys Asp Gin Val Ser Asp Ala lie Leu Asp Ala Cys Leu 
20 25 30 

Glu Gin Asp Pro Glu Ser Lys Val Ala Cys Glu Thr Cys Thr Lys Thr 
35 40 45 

Asn Met Val Met Val Phe Gly Glu lie Thr Thr Lys Ala Thr Val Asp 
50 55 60 

Tyr Glu Lys He Val Afg Asp Thr Cys Arg Gly He Gly Phe Thr Ser 
65 70 75 80 

Ala Asp Val Gly Leu Asp Ala Asp Asn Cys Lys Val Leu Val Asn lie 
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85 90 95 

Glu Gin Gin Ser Pro Asp He Ala Gin Gly Val His Gly His Leu Thr 
100 105 HO 

Lys Lys Pro Glu Glu He Gly Ala Gly Asp Gin Gly His Met Phe Gly 
115 120 125 

Tyr Ala Thr Asp Glu Thr Pro Glu Leu Met Pro Leu Thr His Val Trp 
130 135 140 

Ala Thr Lys Leu Gly Ala Lys Leu Thr Glu Val Arg Lys Asn Lys Thr 
145 150 155 160 

Cys Pro Trp Leu Arg Pro Asp Gly Lys Thr Gin Val Thr Val Glu Tyr 
165 170 175 

Lys Asn Asp Asn Gly Ala Met Val Pro He Arg Val His Thr Val Leu 
180 185 190 

He Ser Thr Gin His Asp Glu Thr Val Thr Asn Asp Gin He Ala Gin 
195 200 205 

Asp Leu Lys Glu His Val He Lys Pro Val He Pro Ser Gin Tyr Leu 
210 215 220 

Asp Glu Asn Thr He Phe His Leu Asn Pro Ser Gly Arg Phe Val He 
225 230 235 240 

Gly Gly Pro His Gly Asp Ala Gly Leu Thr Gly Arg Lys He He He 
245 250 255 

Asp Thr Tyr Gly Gly Trp Gly Ala His Gly Gly Gly Ala Phe Ser Gly 
.260 265 270 

Lys Asp Pro Thr Lys Val Asp Arg Ser Gly Ala Tyr He Val Arg Gin 
275 280 285 

Ala Ala Lys Ser Val Val Ala Ser Gly Leu Ala Arg Arg Cys He Val 
290 295 300 

Gin Val Ser Tyr Ala He Gly Val Ala Glu Pro Leu Ser Val Phe Val 
305 310 315 320 

Asp Thr Tyr Lys Thr Gly Thr He Pro Asp Lys Asp He Leu Thr Leu 
325 330 335 

lie Lys Glu Asn Phe Asp Phe Arg Pro Gly Met Met Ser He Asn Leu 
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345 
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350 



Asp Leu Leu Arg Gly Gly Asn Phe Arg Tyr Gin Lys Thr Ala Ala Tyr 
355 360 365 



Gly His Phe Gly Arg Asp Asp Pro Asp Phe Ser Trp Glu Thr Val Lys 
370 375 380 



Val Leu Lys Pro Lys Ala 
385 390 



<210> 15 
<211> 1596 
<212> DNA 
<213> Plant 

<400> 15 

ggcacgaggg gaacaagaga aacatcatat 
ttgattcctt cctctcattt acctctctct 
caatcatcgt ttccgggttg aacccggcgg 
cttctcctac agcggcggcg gcggcggaaa 
gagatgcctt acaagatttc atgttatcaa 
aaccttttta cgtgctagac ttgggtgagg 
ctctcccaaa tatccgtcca ttttacgctg 
caattttatc tgctatgggc tcaaattttg 
ttttatctct tggcatttca cctgaccgta 
ccgatattat ttttgcagca aaagttgggg 
aggtttacaa gatccgaaag catcacccga 
tgctcgacgg caacgcgaga tgcccaatgg 
tcgacccgct gctccgggca gctcaagccg 
acatcggtag cggagatgcc gattcaaacg 
aagtgtttga aacagctgct aaactcggga 
gcgggtttac atccggccac cagttcacaa 
aacaacactt cgatgacgaa ccggagttga 
cagagacggc gtttactttg gcaacgacga 
gggagtattg gattaacgac gggctgtacg 
cgacggtgaa tgcaacgccg ttagctgttc 
ggtcgaaaac gtttccgacg actgtgtttg 
taagggatta ccagttaccg gagctgcagg 
gtgcttatac taaagctgct gggtccaatt 
ctcacctcgc ttattcttat ccaagctgat 
gttttgatgg ttttttcctt ttttgggtat 
aatttatatt ccaaatcagc ttgtaattct 
taattgtgat tttctctaaa aaaaaaaaaa 

<210> 16 
- <211> 433 



tattgaatcc ctagtttctt ttctttccct 60 
tttcttcctt tgtttggatg gccggccaaa 120 
ccattcttca gtccacaatt ggcggcggag 180 
acggcaccag aaaagtcatc cctctctcaa 240 
tcataaccca aaaattacaa gatgagaaac 300 
ttgtttctct tatggaccaa tggaaatctg 360 
ttaaatgtaa ccctgaaccg tcgttccttt 420 
attgtgctag ccgagctgaa attgagtatg 480 
ttgttttcgc aaatccatgc aaaccggaat 540 
tgaatcttac aacctatgat tctgaagacg 600 
aatccgaact cttgctccgc atcaagccca 660 
gcccgaaata cggcgcgctt ccagaagaag 720 
cccgtctcac cgtatccggc gtctcattcc 780 
cttatctcgg cgccataacc gcggctaagg 840 
tgtcgaaaat gactgttcta gacgtcggcg 900 
ccgccgccgt cgccgttaaa tcagctttaa 960 
caatcatagc tgaaccgggt cggttttttg 1020 
ttatagggaa aagagtgagg . ggtgaattga 1080 
gttcgatgaa ctgtgtactt tacgaccatg 1140 
tgtcgaatcg tagtaacgtt acctgcggcg 1200 
ggcccacttg tgatgctctt gatactgttt 1260 
ttaatgattg gctggttttt cctaatatgg 1320 
ttaatggatt taatacttcc gccattgtta 1380 
gaaccacctg tattaggaat tactaccgtg 1440 
ctttttttta attttgttgt ttttggtagt 1500 
cttgtatgcc ataagaatgc aaggatttgc 1560 
aaaaaa 1596 
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<212> PRT 
<213> Plant 

<400> 16 

Met Ala Gly Gin Thr He He Val Ser Gly Leu Asn Pro Ala Ala He 
15 10 15 

Leu Gin Ser Thr He Gly Gly Gly Ala Ser Pro Thr Ala Ala Ala Ala 
20 25 30 

Ala Glu Asn Gly Thr Arg Lys Val He Pro Leu Ser Arg Asp Ala Leu 
35 40 45 

Gin Asp Phe Met Leu Ser He He Thr Gin Lys Leu Gin Asp Glu Lys 
50 55 60 

Gin Pro Phe Tyr Val Leu Asp Leu Gly Glu Val Val Ser Leu Met Asp 
65 70 75 80 

Gin Trp Lys Ser Ala Leu Pro Asn He Arg Pro Phe Tyr Ala Val Lys 
85 90 95 

Cys Asn Pro Glu Pro Ser Phe Leu Ser He Leu Ser Ala Met Gly Ser 
100 105 110 

Asn Phe Asp Cys Ala Ser Arg Ala Glu He Glu Tyr Val Leu Ser Leu 
115 120 125 

Gly He Ser Pro Asp Arg He Val Phe Ala Asn Pro Cys Lys Pro Glu 
130 135 140 

Ser Asp He He Phe Ala Ala Lys Val Gly Val Asn Leu Thr Thr Tyr 
145 150 155 , 160 

Asp Ser Glu Asp Glu Val Tyr Lys He Arg Lys His His Pro Lys Ser 
165 170 175 

Glu Leu Leu Leu Arg He Lys Pro Met Leu Asp Gly Asn Ala Arg Cys 
180 185 190 

Pro Met Gly Pro Lys Tyr Gly Ala Leu Pro Glu Glu Val Asp Pro Leu 
195 200 205 

Leu Arg Ala Ala Gin Ala Ala Arg Leu Thr Val Ser Gly Val Ser Phe 
210 215 220 

His He Gly Ser Gly Asp Ala Asp Ser Asn Ala Tyr Leu Gly Ala He 
225 230 235 240 
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Ala Ala Ala Lys Glu Val Phe Glu Thr Ala Ala Lys Leu Gly Met Ser 
245 250 255 

Lys Met Thr Val Leu Asp Val Gly Gly Gly Phe Thr Ser Gly His Gin 
260 265 270 

Phe Thr Thr Ala Ala Val Ala Val Lys Ser Ala Leu Lys Gin His Phe 
275 280 285 

Asp Asp Glu Pro Glu Leu Thr lie lie Ala Glu Pro Gly Arg Phe Phe 
290 295 300 

Ala Glu Thr Ala Phe Thr Leu Ala Thr Thr lie He Gly Lys Arg Val 
305 310 315 320 

Arg Gly Glu Leu Arg Glu Tyr Trp lie. Asn Asp Gly Leu Tyr Gly Ser 
325 330 335 

Met Asn Cys Val Leu Tyr Asp His Ala Thr Val Asn Ala Thr Pro Leu 
340 345 350 

Ala Val Leu Ser Asn Arg Ser Asn Val Thr Cys Gly Gly Ser Lys Thr 
355 360 365 

Phe Pro Thr Thr Val Phe Gly Pro Thr Cys Asp Ala Leu Asp Thr Val 
370 375 380 

Leu Arg Asp Tyr Gin Leu Pro Glu Leu Gin Val Asn Asp Trp Leu Val 
385 390 395 400 

Phe Pro Asn Met Gly Ala Tyr Thr Lys Ala Ala Gly Ser Asn Phe Asn 
405 410 415 

Gly Phe Asn Thr Ser Ala He Val Thr His Leu Ala Tyr Ser Tyr Pro 
420 425 430 

Ser 



<210> 17 
<211> 2074 
<212> DNA 
<213> Plant 

<400> 17 

tggtaactgg accgacgcga catttgtcgt atatgtctta atcgggctag tcgctgacaa 60 
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catcatccac caagtcaaag ttcggaaatt 
aaatgagggg actatctgta tacggtcaaa 
gattagaaca taatggtcta aggttcatca 
gttgtcaagc tcaagcccca gagagcgatc 
agacctagag accgatcaat accgagaccg 
gaccggtcaa gattgagatc ggccaagatc 
atagccgcat ttagggagag aatctctgcg 
attaattaat ctatcatgtg atccccacta 
cccccactat attaagagtg gttatcattt 
gacatataca gaaaatagag caaatactat 
tgtttcttct acccattgtt cttcactcaa 
ctaactagtc cattcgggtt gcattcattt 
attttctcaa ttgtactaag ttataccacg 
tatccatttt tcgggtaaac accgaataca 
ccaaagccaa gggttcattc ctttctgaag 
agttatctaa atctttataa tttagccttc 
agaacaatag cctcgcatag atttaccttc 
tttgcaaatg tgtctaaaat gataggactt 
ctttaatttt caagaaatta attttaagca 
ttgccatctc tgactcggat aggacctcga 
atacattaca taagccacaa aatagtaact 
tcgcgattat ttcataccta acacgtttac 
tagtgatata aaataccgaa tttaccacgc 
agagaatctc ttagcccccc aaatacgaaa 
ttaccatgaa aataaagaaa aagagaaaag 
tccactgccc agccacgcat ttccctccta 
tcgctcactt cacagttgtt agtttcacgt 
tccgttaggt ttccgtccta ttttctcttc 
tgtatggtgt ttttcttgtt cgaattttag 
ataattctct gtttaaacca aaaacttagc 
atcgtgtaag agtgtgttag aaggtgatta 
ttgagggggt agccggggcc tcggcctcgg 
cattgggcaa aaacatcatt aaatctgtac 
tgtattcttt gattctttaa cagaagaaga 



catatcgttt ctcatcatct tctatccgag 120 
accgagtctg cccttcatat gactaatcga 180 
ttataataac gagccatgat atagagttag 240 
aatatcgaga tcgagccaag gttaaactcg 300 
accaagtcaa ggtcgagctc gagacccaga 360 
gagatcgagc caagaaatta aaaagtcgtt 420 
gaaatcacga cttgaatcag ggaaaaacta 480 
tgtattttta attatactca aaatgggatt 540 
gtaatggaga gacatacaca cattcattct 600 
ccttttttgg cttttgatat ttagtcatat 660 
tttggaggtg ataaaacttg aaggtttaag 720 
cttttacaat aatttcgtca tcatttattt 780 
tatttttaga actgcgtata aattcaactc 840 
tagcacaata gcaccctcaa ttgcaaaagt 900 
aaatgagata gagaattgaa aatctaattt 960 
catataagaa aaaggaaaca aattaactga 1020 
tccatataaa ttttgtttat actcaatttt 1080 
gcaaattttt atttaacatt tcctactcct 1140 
ttctcgattt gctctgcccg ctccgtcccg 1200 
ttgcaaaaat ccaaacccaa ggaaccttcc 1260 
attaaaaact accaatatat cctcaaatac 1320 
cttatcttct cgtaatgacg ctacattagt 1380 
ggcaaccctc cgctgtctat ccacggcccg 1440 
attaacttct agaattttat tttctggtta 1500 
tcaagaaatt taattgggct aatactgggg 1560 
tataaagcgt cgtcacctct catgcaaatc 1620 
tctcttctca attcccataa aagaaaccct 1680 
ttctacgctt cctcttctga tatcaatatc 1740 
atttgttttg cctttaatac ctgtaacctt 1800 
ttcttctgaa gtcagggtgg ggatatttgg 1860 
tcttttgatt cagttccttt tttgcttctt 1920 
cgggttttaa tagcccccat ctattacaac 1980 
aaagcaaacc cttaatttag tttaattttc 2040 
agag 2074 



<210> 18 
<211> 4321 
<212> DNA 
<213> Plant 

<400> 18 

tggtaactgg accgacgcga catttgtcgt atatgtctta atcgggctag tcgctgacaa 60 

catcatccac caagtcaaag ttcggaaatt catatcgttt ctcatcatct tctatccgag 120 

aaatgagggg actatctgta tacggtcaaa accgagtctg cccttcatat gactaatcga 180 

gattagaaca taatggtcta aggttcatca ttataataac gagccatgat atagagttag 240 

gttgtcaagc tcaagcccca gagagcgatc aatatcgaga tcgagccaag gttaaactcg 300 

agacctagag accgatcaat accgagaccg accaagtcaa ggtcgagctc gagacccaga 360 

gaccggtcaa gattgagatc ggccaagatc gagatcgagc caagaaatta aaaagtcgtt 420 
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atagccgcat ttagggagag aatctctgcg gaaatcacga cttgaatcag ggaaaaacta 480 

attaattaat ctatcatgtg atccccacta tgtattttta attatactca aaatgggatt 540 

cccccactat attaagagtg gttatcattt gtaatggaga gacatacaca cattcattct 600 

gacatataca gaaaatagag caaatactat ccttttttgg cttttgatat ttagtcatat 660 

tgtttcttct acccattgtt cttcactcaa tttggaggtg ataaaacttg aaggtttaag 720 

ctaactagtc cattcgggtt gcattcattt cttttacaat aatttcgtca tcatttattt 780 

attttctcaa ttgtactaag ttataccacg tatttttaga actgcgtata aattcaactc 840 

tatccatttt tcgggtaaac accgaataca tagcacaata gcaccctcaa ttgcaaaagt 900 

ccaaagccaa gggttcattc ctttctgaag aaatgagata gagaattgaa aatctaattt 960 

agttatctaa atctttataa tttagccttc catataagaa aaaggaaaca aattaactga 1020 

agaacaatag cctcgcatag atttaccttc tccatataaa ttttgtttat actcaatttt 1080 

tttgcaaatg tgtctaaaat gataggactt gcaaattttt atttaacatt tcctactcct 1140 

ctttaatttt caagaaatta attttaagca ttctcgattt gctctgcccg ctccgtcccg 1200 

ttgccatctc tgactcggat aggacctcga ttgcaaaaat ccaaacccaa ggaaccttcc 1260 

atacattaca taagccacaa aatagtaact attaaaaact accaatatat cctcaaatac 1320 

tcgcgattat ttcataccta acacgtttac cttatcttct cgtaatgacg ctacattagt 1380 

tagtgatata aaataccgaa tttaccacgc ggcaaccctc cgctgtctat ccacggcccg 1440 

agagaatctc ttagcccccc aaatacgaaa attaacttct agaattttat tttctggtta 1500 

ttaccatgaa aataaagaaa aagagaaaag tcaagaaatt taattgggct aatactgggg 1560 

tccactgccc agccacgcat ttccctccta tataaagcgt cgtcacctct catgcaaatc 1620 

tcgctcactt cacagttgtt agtttcacgt tctcttctca attcccataa aagaaaccct 1680 

tccgttaggt ttccgtccta ttttctcttc ttctacgctt cctcttctga tatcaatatc 1740 

tgtatggtgt ttttcttgtt cgaattttag atttgttttg cctttaatac ctgtaacctt 1800 

ataattctct gtttaaacca aaaacttagc ttcttctgaa gtcagggtgg ggatatttgg 1860 

atcgtgtaag agtgtgttag aaggtgatta tcttttgatt cagttccttt tttgcttctt 192 0 

ttgagggggt agccggggcc tcggcctcgg cgggttttaa tagcccccat ctattacaac 1980 

cattgggcaa aaacatcatt aaatctgtac aaagcaaacc cttaatttag tttaattttc 2040 

tgtattcttt gattctttaa cagaagaaga agagatgccg gccctaggtt gttgcgtaga 2100 

cgctactgtt tcccctcctc tcggctatgc cttctctcgg gatagctctc ttcccgcgcc 2160 

ggagttcttt acctccggcg tacctcctac aaactccgcc gccggttcca ttgggtctcc 2220 

ggatctgtcc tctgctttgt acggggtcga tgggtgggga gctccttatt tctccgttaa 2280 

ctctaacgga gatatctccg tccgaccaca tggtacggac acactccccc accaggaaat 2340 

tgaccttctc aaggtcgtga aaaaggcctc cgacccgaaa aattcagggg ggctcgggct 2400 

tcagctgcct cttgttgttc gcttccctga tgtgctaaaa aaccggttgg aatctctgca 2460 

atcggctttt gatctcgctg ttcattccca gggctatggg gcccactacc aaggtgttta 2520 

tcccgtgaaa tgcaatcaag acaggttcgt ggtggaagat attgtcaaat tcgggtcgtc 2580 

attccggttc gggttggaag ctgggtctaa acccgagctc ctgttagcca tgagctgtct 2640 

ctgcaggggc agtgctgagg gccttctcgt ttgcaatggt ttcaaggacg ctgagtacat 2700 

ttcgcttgct ttggttgcaa gaaagctcat gttaaacact gtaattgttc ttgaacaaga 2760 

ggaggagctt gaccttgtga ttgatataag ccgtaagatg gctgttcggc ccgtaattgg 2820 

acttcgggct aagctcagga ccaagcattc aggccatttt ggatccactt ctggagaaaa 2880 

aggtaagttt gggcttacaa cgacccaaat tgttcgtgta gtgaagaagc tggaagaatc 2940 

cggaatgctg gattgccttc agttgctgca ttttcacatt ggatctcaga tcccttcaac 3000 

ggcgttgctt gctgatggtg ttggtgaggc tgctcagatt tattgtgaat taatccgtct 3060 

tggtgcgggt atgaagttca ttgatactgg aggtgggctc ggaattgatt atgatggtac 3120 

taaatcatgt gattcagatg tctctgttgg ctatggcatt caagaatacg cctccacagt 3180 

tgtccaggcg gttcaatatg tttgcgaccg taagggcgtg aagcacccag tgatttgcag 3240 

cgaaagtggc agggcaattg tttctcatca ctcaattctg attttcgaag ccgtgtctgc 3300 
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ttctagtcac tcatgttctt cttcacatct gtcttctggt ggcctccaat ccatggcgga 33 60 

gacgctcaat gaagatgccc ttgctgatta ccgcaattta tctgctgctg cagttcgtgg 3420 

agagtacgag acgtgtgtac tttactctga tcagttgaaa cagagatgtg tggatcagtt 3480 

taaagaaggg tccttgggta ttgaacatct tgctgctgtt gatagcatct gtgattttgt 3540 

atcaaaggct atgggggctg ctgatcctat ccgcacttac catgtgaatc tgtcaatttt 3600 

cacttcaatt cctgattttt gggcctttgg tcaattgttt ccgattgttc caatacaccg 3660 

tttagatgaa aagcctgcag taaggggaat attatcggac ttgacttgtg acagtgatgg 3720 

gaaggttgat aagttcattg gtggcgaatc aagcttgcag ctgcatgaat tgggaagtaa 3780 

tggcgatggt ggtgggtatt atctggggat gtttttgggt -ggggcttatg aggaggcgct 38 40 

cggaggactc cacaacctgt ttggtggacc aagcgtggtg cgcgtggtgc agagcgatag 3900 

cgctcacagc ttcgccatgt ctcgctccgt ccctggcccg tcctgcgcgg acgtgctccg 3960 

agcgatgcag cacgagcccg agctcatgtt cgagactctc aagcaccgtg cggaggaatt 4020 

cttggaacaa gaagaagaca aagggctggc cattgcatct ttggccagca gcttagctca 4080 

gtccttccat aacatgcctt accttgtggc gcctgcatct tgctgcttca ctgcagttac 4140 

tgctaacaac ggtggctata actactatta cagtgatgag aatgcagcag attctgctac 4200 

aggggaggat gagatttggt cctattgcac tgcttgaagt gttgtcgtag catctccagt 4260 

tttagtttgt cgtcgaagtt gtctgttttt gaataatacc cttagttggt gatgtttttc 4320 
t 4321 

<210> 19 
<211> 720 
<212> PRT 
<213> Plant- 

<400> 19 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
1.5 10 15 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser lie Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala Pro 

50 55 60 

Tyr Phe Ser Val Asn Ser Asn Gly Asp lie Ser Val Arg Pro His Gly 
65 70 75 80 

Thr Asp Thr Leu Pro His Gin Glu lie Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 110 

Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 
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Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala His 
130 135 140 

Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val Val 
145 150 155 160 

Glu Asp lie Val Lys Phe Gly Ser Ser Phe Arg Phe Gly Leu Glu Ala 
165 170 175 

Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu cys Arg Gly 
180 185 190 

Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu Tyr 
195 200 205 

lie Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val lie 
210 215 220 

Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val lie Asp lie Ser Arg 
225 230 235 240 

Lys Met Ala Val Arg Pro Val lie Gly Leu Arg Ala Lys Leu Arg Thr 
245 250 255 

Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys Phe 
260 265 270 

Gly Leu Thr Thr Thr Gin lie Val Arg Val Val Lys Lys Leu Glu Glu 
275 280 285 

Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His He Gly Ser 
290 295 300 

Gin He Pro Ser Thr Ala Leu Leu Ala Asp Gly Val Gly Glu Ala Ala 
305 310 315 320 

Gin He Tyr Cys Glu Leu He Arg Leu Gly Ala Gly Met Lys Phe He 
325 330 335 

Asp Thr Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser Cys 
340 345 350 

Asp Ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser Thr 
355 360 365 

Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys His 
370 375 380 
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Pro Val He Cys Ser Glu Ser Gly Arg Ala . He Val Ser His His Ser 
385 390 395 400 

He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser Ser 
405 410 415 

Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu Asn 
420 425 430 

Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val Arg 
435 440 445 

Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin Arg 
450 455 460 

Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser He Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 _ 495 

Asp Pro He Arg Thr Tyr His Val Asn Leu Ser He Phe Thr Ser He 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro He Val Pro He His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly He Leu Ser Asp Leu Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe He Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
565 570 575 

Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
625 630 635 640 
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Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Glu Asp Lys 
645 650 655 

Gly Leu Ala lie Ala Ser Leu Ala Ser Ser Leu Ala Gin Ser Phe His 
660 665 670 

Asn Met Pro Tyr Leu Val Ala Pro Ala Ser Cys Cys Phe Thr Ala Val 
675 680 685 

Thr Ala Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn Ala 
690 695 700 



Ala Asp Ser Ala Thr Gly Glu Asp Glu lie Trp Ser Tyr Cys Thr Ala 
705 710 715 720 



<210> 20 
<211> 2118 
<212> DNA 
<213> Plant 

<400> 20 

gaattcctta tccggatttc tggtacgcag actgtaatat ggagtcatct tctcctcgat 60 

tcgggattaa aattaggtga cttgggacac cctaaatctc ccaagtggcg actctgaaat 120 

aaataaacaa atcccgtttc gattgtcctt aaattggaaa aaactccctt gtaccctccc 180 

gggtacggaa aaaggaggtg tacagcaatg acccaaaact tttattgcta tacattttga 240 

ggaatcaact tgatcaaaat ttatgggtga aattcaatgt ggtatgattt atattaggtc 300 

ggactttagc agatgtggtc acttcaattt gcggcaaaaa taatgtacag ggataataat 360 

aaaaagtact agaaatttga gtcataaagc tttttcaatt ttacaaaaga tattaagata 420 

cttattaaat caaatgtact ttattaatgt aatagcatga aaaaacagcc tcatccgcct 480 

gtcctcaccc cacaaaaagg agatagagaa aggaaactaa tcttatttaa ttttccacat 540 

ataaaattta tatccttgta taaatcccca aaaaaaaaaa atcaatacta attattttaa 600 

tttaatcatc cgtataagaa agaagctaat taactgactt acaaactgaa tagatagcac 660 

aatagcactc tcaattacaa aaatccaaag ccgagggtca ttcctttcat caagaaatta 720 

gatagggaat ggaaaatata atttaattat ctgaatcttt ataatttatc cttccatata 780 

agaaaaagga aacaaattaa ctgaagagca tatagcctcg catagattta ccttctccat 840 

atgggtgggg aaaccgacaa accgcaccaa ttcgataatt cgagtcaaac tgaggaaaaa 900 

aaaattcgac tatggtttgg tttgatttgg tttattgttg ggataaaaaa tcgatcataa 960 

ttggtttggt ttggttttaa ctaaagaaag tcaaaccgaa accaaacaaa cccgacatta 1020 

catatataaa ttttttagat atatttaata tataaatata cttgttgtga tgtaatttat 1080 

aaatatttct taaaaatatt cataatttta tcttttaaga tattatttcg tacttagaac 1140 

ttttgaatgt ttcttactcc tctttaattt gttagaaatt aatttgaagg agtttgaatt 1200 

tgctccgccc ccgctccgtc ccgttgccat ccctgactca ataggataac agcaatctcg 1260 

attgcaaaaa tccaaaccca aggaaccttc ccaacattac ataagctaca aagtagagta 1320 
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gtttattaaa taactaccaa tatatcctca aattctcgcg attatttcat acctaacacg 1380 

cttaccttat cttctcgtaa tgacgctaca ttagttggtg atataaaata ccgaatttgc 1440 

cacgcggcaa tcctccgctg tctatccacg gcccgagaga atctcttagc cccccaaaga 1500 

tgaaaattaa cttctagaat tttattttct ggttattacc atgaaaataa ttaaataaaa 1560 

aaaaagagaa aagtaaagat atttaattgg gctaaaactg gggtccacgg cccagccacg 1620 

catttccctc ctatataaag cgtcgtcacc tctcatgcaa atctcgctca ctacacagtt 1680 

gttagtttca cgttctcttc tcaattccca taacagaaac ccttccgtta ggtttccgtc 1740 

ctatttttcc tcatcttctc cgtttcctct tctgaaatca atatctgtat ggtgtttttc 18 00 

ttgttcgaat tttagatttg ttttgtcttt aatacctata accttaaatt ctctgtttaa 18 60 

accaaaaact tagcttcttc tgaagtcagg gtggggattt ttggatcgtg taagagtgtg 192 0 

ttagagggtg attatctttt gattcagttc cttttttgct tcttttgagg gggtagccgg 198 0 

ggcctcggcc tcggcgggtt ttaatagccc ccatctatta caactattgg gcaaaaacat 2040 

cattaaatct gtacaaaaca aacccttaat ttagtttaat tttctgtatt cattgatttt 2100 
ttaacagaag aagaagag 2118 

<210> 21 
<211> 4368 
<212> DNA 
<213> Plant 

<400> 21 

gaattcctta tccggatttc tggtacgcag actgtaatat ggagtcatct tctcctcgat 60 
tcgggattaa aattaggtga cttgggacac cctaaatctc ccaagtggcg actctgaaat 120 
aaataaacaa atcccgtttc gattgtcctt aaattggaaa aaactccctt gtaccctccc 180 
gggtacggaa aaaggaggtg tacagcaatg acccaaaact tttattgcta tacattttga 240 
ggaatcaact tgatcaaaat ttatgggtga aattcaatgt ggtatgattt atattaggtc 300 
ggactttagc agatgtggtc acttcaattt gcggcaaaaa taatgtacag ggataataat 360 
aaaaagtact agaaatttga gtcataaagc tttttcaatt ttacaaaaga tattaagata 420 
cttattaaat caaatgtact ttattaatgt aatagcatga aaaaacagcc tcatccgcct 480 
gtcctcaccc cacaaaaagg agatagagaa aggaaactaa tcttatttaa ttttccacat 540 
ataaaattta tatccttgta taaatcccca aaaaaaaaaa atcaatacta attattttaa 600 
tttaatcatc cgtataagaa agaagctaat taactgactt acaaactgaa tagatagcac 660 
aatagcactc tcaattacaa aaatccaaag ccgagggtca ttcctttcat caagaaatta 720 
gatagggaat ggaaaatata atttaattat ctgaatcttt ataatttatc cttccatata 780 
agaaaaagga aacaaattaa ctgaagagca tatagcctcg catagattta ccttctccat 840 
atgggtgggg aaaccgacaa accgcaccaa ttcgataatt cgagtcaaac tgaggaaaaa 900 
aaaattcgac tatggtttgg tttgatttgg tttattgttg ggataaaaaa tcgatcataa 960 
ttggtttggt ttggttttaa ctaaagaaag tcaaaccgaa accaaacaaa cccgacatta 1020 
catatataaa ttttttagat atatttaata tataaatata cttgttgtga tgtaatttat 1080 
aaatatttct taaaaatatt cataatttta tcttttaaga tattatttcg tacttagaac 1140 
ttttgaatgt ttcttactcc tctttaattt gttagaaatt aatttgaagg J.agtttgaatt 1200 
tgctccgccc ccgctccgtc ccgttgccat ccctgactca ataggataac agcaatctcg. 1260 
attgcaaaaa tccaaaccca aggaaccttc ccaacattac ataagctaca aagtagagta 1320 
gtttattaaa taactaccaa tatatcctca aattctcgcg attatttcat acctaacacg 1380 
cttaccttat cttctcgtaa tgacgctaca ttagttggtg atataaaata ccgaatttgc 1440 
cacgcggcaa tcctccgctg tctatccacg gcccgagaga atctcttagc cccccaaaga 1500 
tgaaaattaa cttctagaat tttattttct ggttattacc atgaaaataa ttaaataaaa 1560 
aaaaagagaa aagtaaagat atttaattgg gctaaaactg gggtccacgg cccagccacg 1620 
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catttccctc ctatataaag cgtcgtcacc tctcatgcaa atctcgctca ctacacagtt 1680 

gttagtttca cgttctcttc tcaattccca taacagaaac ccttccgtta ggtttccgtc 1740 

ctatttttcc tcatcttctc cgtttcctct tctgaaatca atatctgtat ggtgtttttc 1800 

ttgttcgaat tttagatttg ttttgtcttt aatacctata accttaaatt ctctgtttaa 1860 

accaaaaact tagcttcttc tgaagtcagg gtggggattt ttggatcgtg taagagtgtg 1920 

ttagagggtg attatctttt gattcagttc cttttttgct tcttttgagg gggtagccgg 1980 

ggcctcggcc tcggcgggtt ttaatagccc ccatctatta caactattgg gcaaaaacat 2040 

cattaaatct gtacaaaaca aacccttaat ttagtttaat tttctgtatt cattgatttt 2100 

ttaacagaag aagaagagat gccggcccta ggttgttgtg tagatgctgc tgttgtttcc 2160 

cctcctctca gctatgcctt ctctcgggat agctctcttc ccgcgccgga gttctttgcc 2220 

tccggcgtac ctccgacaaa ttctgccgct gcttccattg ggtctccgga tttgtcgtct 2280 

gctttatacg gggtcgatgg gtggggagct ccttatttct ctgttaactc taatggagat 2340 

atctccgtcc gaccacacgg tacggacact ctccctcacc aggaaattga ccttctcaag 2400 

gtcgtgaaaa aggcctccga cccgaaaaat tcaggtgggc ttgggcttca gctgcctctt 24 60 

gttgttcgct tccctgatgt gttgaaaaac cggttggaat ctctgcaatc ggcttttgat 2520 

ctcgcggttc attcccaggg ctatggggcc cactaccaag gtgtttatcc cgtgaaatgc 2580 

aatcaagaca ggttcgtggt ggaagatatc gtgaaattcg ggtcgccatt ccggttcggg 2 640 

ttggaagccg ggtctaaacc cgagctcctg ttagccatga gctgtctctg caagggcagt 27 00 

gctgagggcc ttctcgtttg caatggtttc aaggacgctg agtacatttc gcttgctttg 27 60 

gttgcaagaa agctcatgtt aaacactgta attgtgcttg aacaagagga ggagcttgac 2 820 

cttgtgattg atataagcca taagatggct gttcggcctg taattggact tcgggctaag 2880 

ctcaggacca agcattcagg ccattttgga tccacttctg gagaaaaagg taagtttggg 2940 

cttacaacga cccaaattgt tcgtgtggtg aagaagctag aagaatccgg aatgctggat 3000 

tgtcttcagt . tgctgcattt tcacattgga tctcagatcc cttctacggg gttgctagct 3060 

gatggagttg gtgaggccgc tcagatttat tgtgaattag tccgtcttgg agcgggtatg 3120 

aagttcattg atattggagg tgggcttgga attgattatg atggtactaa atcatgcgat 3180 

tctgatgtct ctgttggcta tggcattcaa gaatatgcct ccgcagttgt: tcaggcggtt 3240 

caatatgtat gcgaccgtaa gggtgtgaaa cacccagtga tctgcagcga aagtggcagg 3300 

gcaattgttt ctcatcactc aattctgatt ttcgaagccg tgtctgcttc tagtcactca 3360 

tgttcttctt cacatctgtc ttctggtggc ctccaatcca tggcggagac gctcaacgaa 3420 

gatgcccttg ctgattaccg caatttatct gctgctgcag ttcgtggaga gtatgagaca 3480 

tgtgtacttt actctgatca gttgaaacag agatgtgtgg atcagtttaa agaagggtcc 3540 

ttgggtattg aacatcttgc tgctgttgat agcatctgtg attttgtatc aaaggctatg 3600 

ggggctgctg atcctgtccg cacttaccat gtgaatctgt caattttcac ttcaattcct 3660 

gatttttggg cctttggtca attgtttccg attgttccaa ttcaccgctt agatgaaaag 3720 

cctgcagtga ggggaatatt atcggactta acttgtgaca gtgatgggaa ggttgataag 3780 

ttcattggtg gcgaatcaag cttgccgcta catgaattgg gaagtaatgg cgatggtggt 3840 

ggttattatc tggggatgtt tttgggtggg gcttatgagg aggcgctcgg aggactccac 3900 

aacctgtttg gtggaccaag tgtcgtgcgc gtggtgcaga gcgatagcgc tcacagcttt 3960 

gccatgactc gctccgtccc tggcccgtct tgcgctgatg tgctccgagc gatgcagcac 4020 

gagcccgagc tcatgttcga gactctcaag caccgtgcgg aggaattctt ggaacaagaa 4080 

gatgacaaag ggctggctgt tgaatctttg gccagcagcg tagctcagtc cttccataac 4140 

atgccttacc ttgtggcgcc ttcatcttgc cgcttcactg ctgctactga taacaatggt 4200 

ggctataatt actattacag tgatgagaat gcagcagatt ctgctacagg ggaggatgag 42 60 

atttggtcct attgcactgc ttgaagtgtt ctcgtagcat ctccagtctt agtttgtcgt 4320 

cgaggttgtc tgtttttgaa taataccctt agttggtgat gtttttct 4368 

<210> 22 
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<211> 721 
<212> PRT 
<213> Plant 



<400> 22 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Ala Val Val Ser Pro Pro 
15 10 15 

Leu Ser Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Giu Phe 
20 25 30 

Phe Ala Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Ala Ser lie Gly 
35 40 45 

Ser Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala 
50 55 60 

Pro Tyr Phe Ser Val Asn Ser Asn Gly Asp He Ser Val Arg Pro His 
65 70 75 80 

Gly Thr Asp Thr Leu Pro His Gin Glu He Asp Leu Leu Lys Val Val 
85 90 95 

Lys Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu 
100 105 HO 

Pro Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser 
115 120 125 

Leu Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala 
130 135 140 

His Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val 
145 150 155 160 

Val Glu Asp He Val Lys Phe Gly Ser Pro Phe Arg Phe Gly Leu Glu 
165 170 175 

Ala Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Lys 
180 185 190 

Gly Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu 
195 200 205 

Tyr He Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val 
210 215 220 

He Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val He Asp He Ser 
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240 



His Lys Met Ala Val Arg Pro Val lie Gly Leu Arg Ala Lys Leu Arg 
245 250 255 

Thr Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys 
260 265 270 

Phe Gly Leu Thr Thr Thr Gin lie Val Arg Val Val Lys Lys Leu Glu 
275 280 285 

Glu Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His lie Gly 
290 295 300 



Ser Gin lie Pro Ser Thr Gly Leu Leu Ala Asp Gly Val Gly Glu Ala 
305 310 315 320 

Ala Gin He Tyr Cys Glu Leu Val Arg Leu Gly Ala Gly Met Lys Phe 
325 330 335 

He Asp He Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser 
340 345 350 

Cys Asp Ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser 
355 360 365 - 

Ala Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys 
370 375 380 

His Pro Val He Cys Ser Glu Ser Gly Arg Ala He Val Ser His His 
385 390 395 400 

Ser He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser 
405 410 415 

Ser Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu 
420 425 430 



Asn Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val 
435 440 445 

Arg Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin 
450 455 460 

Arg Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu 

470 475 480 

Ala Ala Val Asp Ser He Cys Asp Phe Val Ser Lys Ala Met Gly Ala 
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485 

Ala Asp Pro Val Arg 
500 

lie Pro Asp Phe Trp 
515 

His Arg Leu Asp Glu 
530 

Thr Cys Asp Ser Asp 
545 

Ser Leu Pro Leu His 
565 

Tyr Leu Gly Met Phe 
580 

Leu His Asn Leu Phe 
595 

Asp Ser Ala His Ser 
610 

Cys Ala Asp Val Leu 
625 

Glu Thr Leu Lys His 
645 

Lys Gly Leu Ala Val 
660 

His Asn Met Pro Tyr 
675 

Ala Thr Asp Asn Asn 
690 

Ala Ala Asp Ser Ala 
705 

Ala 



490 

Thr Tyr His Val Asn Leu 
505 

Ala Phe Gly Gin Leu Phe 
520 

Lys Pro Ala Val Arg Gly 
535 

Gly Lys Val Asp Lys Phe 
550 555 

Glu Leu Gly Ser Asn Gly 
570 

Leu Gly Gly Ala Tyr Glu 
585 

Gly Gly Pro Ser Val Val 
600 

Phe Ala Met Thr Arg Ser 
615- 

Arg Ala Met Gin His Glu 
630 635 

Arg Ala Glu Glu Phe Leu 
650 

Glu Ser Leu Ala Ser Ser 
665 

Leu Val Ala Pro Ser Ser 
680 

Gly Gly Tyr Asn Tyr Tyr 
695 

Thr Gly Glu Asp Glu lie 
710 715 



495 

Ser lie Phe Thr Ser 
510 

Pro lie Val Pro He 
525 

He Leu Ser Asp Leu 
540 

He Gly Gly Glu Ser 
560 

Asp Gly Gly Gly Tyr 
575 

Glu Ala Leu Gly Gly 
590 

Arg Val Val Gin Ser 
605 

Val Pro Gly Pro Ser 
620 

Pro Glu Leu Met Phe 
640 

Glu Gin Glu Asp Asp 
655 

Val Ala Gin Ser Phe 
670 

Cys Arg Phe Thr Ala 
685 

Tyr Ser Asp Glu Asn 
700 

Trp Ser Tyr Cys Thr 
720 
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<210> 23 
<211> 2695 
<212> DNA 
<213> Plant 

<400> 23 

ttcacgttct cttctcaatt cccataaaag 
tctcttcttc tacgcttcct cttctgatat 
attttagatt tgttttgcct ttaatacctg 
acttagcttc ttctgaagtc agggtgggga 
gtgattatct tttgattcag ttcctttttt 
gcctcggcgg gttttaatag cccccatcta 
tctgtacaaa gcaaaccctt aatttagttt 
aagaagaaga gatgccggcc ctaggttgtt 
gctatgcctt ctctcgggat agctctcttc 
ctcctacaaa ctccgccgcc ggttccattg 
gggtcgatgg gtggggagct ccttatttct 
gaccacatgg tacggacaca ctcccccacc 
aggcctccga cccgaaaaat tcaggggggc 
tccctgatgt gctaaaaaac cggttggaat 
attcccaggg ctatggggcc cactaccaag 
ggttcgtggt ggaagatatt gtcaaattcg 
ggtctaaacc cgagctcctg ttagccatga 
ttctcgtttg caatggtttc aaggacgctg 
agctcatgtt aaacactgta attgttcttg 
atataagccg taagatggct gttcggcccg 
agcattcagg ccattttgga tccacttctg 
cccaaattgt tcgtgtagtg aagaagctgg 
tgctgcattt tcacattgga tctcagatcc 
gtgaggctgc tcagatttat tgtgaattaa 
atactggagg tgggctcgga attgattatg 
ctgttggcta tggcattcaa gaatacgcct 
gcgaccgtaa gggcgtgaag cacccagtga 
ctcatcactc aattctgatt ttcgaagccg 
cacatctgtc ttctggtggc ctccaatcca 
ctgattaccg caatttatct gctgctgcag 
actctgatca gttgaaacag agatgtgtgg 
aacatcttgc tgctgttgat agcatctgtg 
atcctatccg cacttaccat gtgaatctgt 
cctttggtca attgtttccg attgttccaa 
ggggaatatt atcggacttg acttgtgaca 
gcgaatcaag cttgcagctg catgaattgg 
tggggatgtt tttgggtggg gcttatgagg 
gtggaccaag cgtggtgcgc gtggtgcaga 
gctccgtccc tggcccgtcc tgcgcggacg 
tcatgttcga gactctcaag caccgtgcgg 
ggctggccat tgcatctttg gccagcagct 
ttgtggcgcc tgcatcttgc tgcttcactg 
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aaacccttcc gttaggtttc cgtcctattt 60 
caatatctgt atggtgtttt tcttgttcga 120 
taaccttata attctctgtt taaaccaaaa 180 
tatttggatc gtgtaagagt gtgttagaag 240 
gcttcttttg agggggtagc cggggcctcg 300 
ttacaaccat tgggcaaaaa catcattaaa 360 
aattttctgt attctttgat tctttaacag 420 
gcgtagacgc tactgtttcc cctcctctcg 430 
ccgcgccgga gttctttacc tccggcgtac 540 
ggtctccgga tctgtcctct gctttgtacg 600 
ccgttaactc taacggagat atctccgtcc 660 
aggaaattga ccttctcaag gtcgtgaaaa 720 
tcgggcttca gctgcctctt gttgttcgct 780 
ctctgcaatc ggcttttgat ctcgctgttc 840 
gtgtttatcc cgtgaaatgc aatcaagaca 900 
ggtcgtcatt ccggttcggg ttggaagctg 960 
gctgtctctg caggggcagt gctgagggcc 1020 
agtacatttc gcttgctttg gttgcaagaa 1080 
aacaagagga ggagcttgac cttgtgattg 1140 
taattggact tcgggctaag ctcaggacca 1200 
gagaaaaagg taagtttggg cttacaacga 1260 
aagaatccgg aatgctggat tgccttcagt 1320 
cttcaacggc gttgcttgct gatggtgttg 1380 
tccgtcttgg tgcgggtatg aagttcattg 1440 
atggtactaa atcatgtgat tcagatgtct 1500 
ccacagttgt ccaggcggtt caatatgttt 1560 
tttgcagcga aagtggcagg gcaattgttt 1620 
tgtctgcttc tagtcactca tgttcttctt 1680 
tggcggagac gctcaatgaa gatgcccttg 1740 
ttcgtggaga gtacgagacg tgtgtacttt 1800 
atcagtttaa agaagggtcc ttgggtattg 1860 
attttgtatc aaaggctatg ggggctgctg 1920 
caattttcac ttcaattcct gatttttggg 1980 
tacaccgttt agatgaaaag cctgcagtaa 2040 
gtgatgggaa ggttgataag ttcattggtg 2100 
gaagtaatgg cgatggtggt gggtattatc 2160 
aggcgctcgg aggactccac aacctgtttg 2220 
gcgatagcgc tcacagcttc gccatgtctc 2280 
tgctccgagc gatgcagcac gagcccgagc 2340 
aggaattctt ggaacaagaa gaagacaaag 2400 
tagctcagtc cttccataac atgccttacc 2460 
cagttactgc taacaacggt ggctataact 2520 
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actattacag tgatgagaat gcagcagatt ctgctacagg ggaggatgag atttggtcct 2580 
attgcactgc ttgaagtgtt gtcgtagcat ctccagtttt agtttgtcgt cgaagttgtc 2640 
tgtttttgaa taataccctt agttggtgat gtttttctaa aaaaaaaaaa aaaaa 2695 

<210> 24 

<211> 720 

<212> PRT 

<213> Plant 

<400> 24 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
15 10 15 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser lie Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala Pro 
50 55 60 

Tyr Phe Ser Val Asn Ser Asn Gly Asp lie Ser Val Arg Pro His Gly 
65 70 75 80 

Thr Asp Thr Leu Pro His Gin Glu He Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 110 

Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 

Gin Ser Ala Phe Asp, Leu Ala Val His Ser Gin Gly Tyr Gly Ala His 
130 135 140 

Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val Val 
145 150 " 155 160 

Glu Asp He Val Lys Phe Gly Ser Ser Phe Arg Phe Gly Leu Glu Ala 
165 170 175 

Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Arg Gly 
180 185 190 

Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu Tyr 
195 200 205 
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lie Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val lie 
210 215 220 

Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val lie Asp lie Ser Arg 
225 230 235 240 

Lys Met Ala Val Arg Pro Val lie Gly Leu Arg Ala Lys Leu Arg Thr 
245 250 255 

Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys Phe 
260 265 270 

Gly Leu Thr Thr Thr Gin He Val Arg Val Val Lys Lys Leu Glu Glu 
275 280 285 

Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His He Gly Ser 
290 295 300 

Gin He Pro Ser Thr Ala Leu Leu Ala Asp Gly Val Gly Glu Ala Ala 
305 310 315 320 

Gin He Tyr Cys Glu Leu He Arg Leu Gly Ala Gly Met Lys Phe He 
325 330 335 

Asp Thr Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser Cys 
340 345 350 

Asp Ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser Thr 
355 360 365 

Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys His 
370 375 380 

Pro Val He Cys Ser Glu Ser Gly Arg Ala He Val Ser His His Ser 
385 390 395 400 

He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser Ser 
405 410 415 

Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu Ash 
420 425 430 

Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val Arg 
435 440 445 

Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin Arg 
450 455 460 



33 



WO 00/67558 PCT/US00/12450 



Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly lie Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser lie Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 495 

Asp Pro lie Arg Thr Tyr His Val Asn Leu Ser lie Phe Thr Ser lie 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro lie Val Pro lie His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly lie Leu Ser Asp Leu Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe lie Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
565 570 575 

Leu Gly Met Phe Leu Gly .Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
. 625 630 635 640 

Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Glu Asp Lys 
645 650 655 

Gly Leu Ala lie Ala Ser Leu Ala Ser Ser Leu Ala Gin Ser Phe His 
660 665 670 

Asn Met Pro Tyr Leu Val Ala Pro Ala Ser Cys Cys Phe Thr Ala Val 
675 680 685 

Thr Ala Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn Ala 
690 695 700 

Ala Asp Ser Ala Thr Gly Glu Asp Glu lie Trp Ser Tyr Cys Thr Ala 
N 705 710 715 720 
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<210> 25 
<211> 914 
<212> DNA 
<213> Plant 

<400> 25 

aagctcgann ttaancctca ntaaagggaa caaaagctgg taccgnggcc ccccctcgag 60 

gtcgacggta tcgataagct tgattaagct tagtangcac attagcagcg cttgggatga 120 

ttttaggcgc ggcctattcc ctttggctat ataatcgtgt ggttctggga attaaaaccc 180 

gatttcctcc ataaattctc cgatctaaat ggcanagaat tttcatattt ataccttttc 240 

ttgttggagt tgtttggatg ggtgtttacc ccaaagtgtt cctgggactg catgcataca 300 

tccgtaagta acttaagtgc aacatggaaa atttcattga gaggaatcag caaaaaaaaa 360 

aagcttaatc gaattcctgc agcccggggg atccactagt tctagagcgg ccgccaccgc 420 

ggtggagctc caattcgccc tatagtgagt cgtattacaa ttcactgggc cgtcgtttta 480 

caacgtcgtg actgggaaan gcctggcgtt accaacttaa tcgccttgca gcacatcccc 540 

ctttcgccag ctgggcgtaa tagcgaanag gccgcacgat cgccttccca acagttgcgc 600 

acctgatggn gaatgggacn gccctgtanc ngcgcangaa ncgcggcngg tgtggtggta 660 

ccccancgtg acgcnaactt gcaacgccta acgccgcnct tcgcttctcc ttcttctngc 72 0 

aagttcncgg ctttccgtaa gtccaagcgg gggnccttag gttcgattat gttnggaccc 78 0 

ccccnaaact gttanggtnt gtgatttggc accccaaaac gtttcccttg nctggtcntt 840 

ttaaangcct ntcaacngaa accaccatcg gnacttgtta aggntncctt gcntgnaaaa 900 

* nngtaaatta nntn 914 

<210> 26 
<211> 829 
<212> DNA 
<213> Plant 

<400> 26 

agcaagctcg atatcgccct cactaaaggg aacaaaaact ggtaccgggc cccccctcga 60 

ggtcgacggt atcgataagc ttgattaanc tttttttttt tgaactacac aagggaattt 120 

cttctcctna gtaacacatg agaataatta gtgcaataaa ttacaagagg aacattgcag 180 

ttggatttaa gaatctgcgc tggggaattt agcctcaata tttgctacaa ccgtacagat 240 

ttcactgcat tcatgaacga tagtatccgt gacacatcct tttggatgcc gtcctgtcca 300 

catatgccac tactcacatc cactccattg ggtttaagtt gcagaaagag cttcacaaac 360 

attctccggg ttaattcctc ctgccaagag ccacccatgt ttgcttctaa tgcgcggcag 42 0 

cttaaactga acccagttga atcctttgcc actgccacct tttgcactat ccactagaac 480 

ccaatcaacc agagaagact cctcatcaga aatatagttc aaaaggctcc tcttcatttg 540 

catgaagtac gtatattaca cgtttttccc tgaccaaagc ttaatcgaat tctgcagccc 600 

gggggatcng gnattctaga gcggcgccac gcggtggagc tccaatcgcc taaatgancn 660 

ataaaatcac tggccgtcgt ttanacgncn ggacgggaaa cctgggtacc aacttaat-cg 720 

cctgnagcna tccccttcnc agcggngtan acgaaaggcc gncgattgcc tccanattgc 780 

cacnggatgg aanggacncc gtncgganga acngggggnn ggggtaccn 829 
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